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Preface 


What  is  Mathematical  Modelling? 

In  order  to  explain  the  purpose  of  modelling,  it  is  helpful  to  start  by  asking:  what  is 
a  mathematical  modell  One  answer  was  given  by  Rutherford  Aris  [4]: 

A  model  is  a  set  of  mathematical  equations  that  . . .  provide  an  adequate  description  of  a 
physical  system. 

Dissecting  the  words  in  his  description,  “ a  physical  system ”  can  be  broadly  inter¬ 
preted  as  any  real-world  problem — natural  or  man-made,  discrete  or  continuous  and 
can  be  deterministic,  chaotic,  or  random  in  behaviour.  The  context  of  the  system 
could  be  physical,  chemical,  biological,  social,  economic  or  any  other  setting  that 
provides  observed  data  or  phenomena  that  we  would  like  to  quantify.  Being  “ ad¬ 
equate ”  sometimes  suggests  having  a  minimal  level  of  quality,  but  in  the  context  of 
modelling  it  describes  equations  that  are  good  enough  to  provide  sufficiently 
accurate  predictions  of  the  properties  of  interest  in  the  system  without  being  too 
difficult  to  evaluate. 

Trying  to  include  every  possible  real-world  effect  could  make  for  a  complete 
description  but  one  whose  mathematical  form  would  likely  be  intractable  to  solve. 
Likewise,  over-simplified  systems  may  become  mathematically  trivial  and  will  not 
provide  accurate  descriptions  of  the  original  problem.  In  this  spirit,  Albert  Einstein 
supposedly  said,  “ Everything  should  be  made  as  simple  as  possible,  but  not  sim¬ 
pler”  [107],  though  ironically  this  is  actually  an  approximation  of  his  precise 
statement  [34]. 

Many  scientists  have  expressed  views  about  the  importance  of  modelling  and  the 
limitations  of  models.  Some  other  notable  examples  are: 

•  In  the  opening  of  his  foundational  paper  on  developmental  biology,  Alan  Turing 
wrote  “This  [mathematical]  model  will  be  a  simplification  and  an  idealisation, 
and  consequently  a  falsification.  It  is  to  be  hoped  that  the  features  retained  for 
discussion  are  those  of  greatest  importance  ...”  [100] 

•  George  Box  wrote  “ ...all  models  are  wrong,  but  some  are  useful.  ”  [17] 
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•  Mark  Kac  wrote  “ Models  are,  for  the  most  part,  caricatures  of  reality,  but  if 
they  are  good,  they  portray  some  features  of  the  real  world.  ”  [55] 

Useful  models  strike  a  balance  between  such  extremes  and  provide  valuable 
insight  into  phenomena  through  mathematical  analysis.  Every  proposed  model  for  a 
problem  should  include  a  description  of  how  results  will  be  obtained — a  solution 
strategy.  This  suggests  an  operational  definition: 

model:  a  useful,  practical  description  of  a  real-world  problem,  capable  of  providing  sys¬ 
tematic  mathematical  predictions  of  selected  properties 

Models  allow  researchers  to  assess  balances  and  trade-offs  in  terms  of  levels  of 
calculational  details  versus  limitations  on  predictive  capabilities. 

Concerns  about  models  being  “wrong”  or  “false”  or  “incomplete”  are  actually 
criticisms  of  the  levels  of  physics,  chemistry  or  other  scientific  details  being 
included  or  omitted  from  the  mathematical  formulation.  Once  a  well-defined 
mathematical  problem  is  set  up,  its  mathematical  study  can  be  an  important  step  in 
understanding  the  original  problem.  This  is  particularly  true  if  the  model  predicts 
the  observed  behaviours  (a  positive  result).  However,  even  when  the  model  does 
not  work  as  expected  (a  negative  result),  it  can  lead  to  a  better  understanding  of 
which  (included  or  omitted)  effects  have  significant  influence  on  the  system’s 
behaviour  and  how  to  further  improve  the  accuracy  of  the  model. 

While  being  mindful  of  the  possible  weaknesses,  the  positive  aspects  of  models 
should  be  praised, 

Models  are  expressions  of  the  hope  that  aspects  of  complicated  systems  can  be  described  by 
simpler  underlying  mathematical  forms. 

Exact  solutions  can  be  found  for  only  a  very  small  number  of  types  of  problems; 
seeking  to  extend  systems  beyond  those  special  cases  often  makes  the  exact 
solutions  unusable.  Modelling  can  provide  more  viable  and  robust  approaches,  even 
though  they  may  start  from  counterintuitive  ideas,  "...  simple,  approximate  solu¬ 
tions  are  more  useful  than  complex  exact  solutions,,  [15]. 

Mathematical  models  also  allow  for  the  exploration  of  conjectures  and  hypo¬ 
thetical  situations  that  cannot  normally  be  de-coupled  or  for  parameter  ranges  that 
might  not  be  easily  accessible  experimentally  or  computationally.  Modelling  lets  us 
qualitatively  and  quantitatively  dissect  problems  in  order  to  evaluate  the  importance 
of  their  various  parts,  which  can  lead  to  the  original  motivating  problem  becoming  a 
building  block  for  the  understanding  of  more  complex  systems.  Good  models 
provide  the  flexibility  to  be  systematically  developed  allowing  more  accurate 
answers  to  be  obtained  by  solving  extensions  of  the  model’s  mathematical  equa¬ 
tions.  In  summary,  our  description  of  the  process  is 

modelling:  a  systematic  mathematical  approach  to  formulation,  simplification  and 
understanding  of  behaviours  and  trends  in  problems. 
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IX 


Levels  of  Models 


Mathematical  models  can  take  many  different  forms  spanning  a  wide  range  of  types 
and  complexity, 


“Toy”  problems 


< 


Math  Models 


< 


Complete  systems 


At  the  upper  end  of  complexity  are  models  that  are  equivalent  to  the  full 
first-principles  scientific  description  of  all  of  the  details  involved  in  the  entire 
problem.  Such  systems  may  consist  of  dozens  or  even  hundreds  of  equations 
describing  different  parts  of  the  problem;  computationally  intensive  numerical 
simulations  are  often  necessary  to  investigate  the  full  system. 

At  the  other  end  of  the  spectrum  are  improvised  or  phenomenological  “toy” 
problems  that  may  have  some  conceptual  resemblance  to  the  original  system  but 
have  no  obvious  direct  derivation  from  that  problem.  These  might  be  only  a  few 
equations  or  just  some  geometric  relations.  They  are  the  mathematical  modelling 
equivalents  of  an  “artistic  impression”  motivated  or  inspired  by  the  original  prob¬ 
lem.  Their  value  is  that  they  may  provide  a  simple  “proof  of  concept”  prototype  for 
how  to  describe  a  key  element  of  the  complete  system. 

Both  extremes  have  drawbacks:  intractable  calculations  in  one  extreme,  and 
imprecise  qualitative  results  at  the  other.  Mathematical  models  exist  in-between  and 
try  to  bridge  the  gap  by  offering  a  process  for  using  identifiable  assumptions  to 
reduce  the  full  system  down  to  a  simpler  form,  where  analysis,  calculations  and 
insights  are  more  achievable,  but  without  losing  the  accuracy  of  the  results  and  the 
connection  to  the  original  problem. 


Classes  of  Real  World  Problems 

The  kinds  of  questions  being  considered  play  an  important  role  in  how  the  model 
for  the  problem  should  be  constructed.  There  are  three  broad  types  of  questions: 

(i)  Evaluation  questions  [also  called  Forward  problems ]:  Given  all  needed 
information  about  the  system,  can  we  quantitatively  predict  its  other  properties 
and  how  the  system  will  function?  Examples:  What  is  the  maximum  attainable 
speed  of  this  car?  How  quickly  will  this  disease  spread  through  the  population 
of  this  city? 

(ii)  Detection  questions  [Inverse  problems ]  [8]:  If  some  information  about  a 
“black  box”  system  is  not  directly  available,  can  you  “reverse  engineer”  those 
missing  parameters?  Examples:  How  can  we  use  data  from  CAT  scans  to 
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Sometimes  also  described  as  ad  hoc  or  heuristic  models. 
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estimate  the  location  of  a  tumour?  Can  we  determine  the  damping  of  an 
oscillator  from  the  decay  of  its  time  series  data? 

(iii)  Design  questions  [ Control  and  optimisation  problems ]:  Can  we  create  a 
solution  that  best  meets  a  proposed  goal?  Examples:  What  shape  paper  air¬ 
plane  flies  the  furthest?  How  should  a  pill  be  coated  to  release  its  drug  at  a 
constant  rate  over  an  entire  day? 

There  are  many  routes  available  to  attack  such  questions  that  are  typically  treated  in 
different  areas  of  study.  This  book  will  introduce  methods  for  addressing  some 
problems  of  the  forms  (i)  and  (iii)  in  the  context  of  continuous  systems  and  dif¬ 
ferential  equations. 


Stages  of  the  Modelling  Process 

The  modelling  process  can  sometimes  start  from  a  creative  and  inspired  toy 
problem  and  then  seeks  to  validate  the  model’s  connection  to  the  original  problem. 
However,  this  approach  requires  having  a  lot  of  previous  experience  with  and 
background  knowledge  on  the  scientific  area  and/or  relevant  mathematical  tech¬ 
niques  in  order  to  generate  the  new  model.  In  this  book,  we  follow  the  more 
systematic  approach  of  starting  with  some  version  of  the  complete  scientific 
problem  statement  and  then  using  mathematical  techniques  to  obtain  reduced 
models  that  can  be  simplified  to  a  manageable  level  of  computational  difficulty. 

The  modelling  process  has  two  stages,  consisting  of  setting  up  the  problem  and 
then  solving  it: 

•  In  the  formulation  phase ,  the  problem  is  described  using  basic  principles  or 
governing  laws  and  assumed  relations  taken  from  some  branches  of  knowledge, 
such  as  physics,  biology,  chemistry,  economics,  geometry,  probability  or  others. 
Then  all  side-conditions  that  are  needed  to  completely  define  the  problem  must 
be  identified:  geometric  constraints,  initial  conditions,  material  properties, 
boundary  conditions  and  design  parameter  values.  Finally,  the  properties  of 
interest,  how  they  are  to  be  measured,  relevant  variables,  coordinate  systems  and 
a  system  of  units  must  all  be  decided  on. 

•  Then~,  in  the  solution  phase ,  mathematical  modelling  provides  approaches  to 
reformulating  the  original  problem  into  a  more  convenient  structure  from  which 
it  can  be  reduced  into  solvable  parts  that  can  ultimately  be  re-assembled  to 
address  the  main  questions  of  interest  for  the  problem. 

In  some  cases,  the  reformulated  problem  may  seem  to  only  differ  from  the  original 
system  at  a  notational  level,  but  these  changes  can  be  essential  for  separating  out 
different  effects  in  the  system.  At  the  simplest  level,  “problem  reduction”  consists  of 


2 

Assuming  that  the  problem  cannot  be  easily  solved  analytically  or  computed  numerically,  and 
hence  does  not  need  modelling. 
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obtaining  so-called  asymptotic  approximations  of  the  solution,  but  for  more  chal¬ 
lenging  problems,  this  will  also  involve  approaches  for  transforming  the  problem 
into  different  forms  that  are  more  tractable  for  analysis  or  computation. 

The  techniques  described  here  are  broadly  applicable  to  many  branches  of 
engineering  and  applied  science:  biology,  chemistry,  physics,  the  geosciences  and 
mechanical  engineering,  to  name  a  few.  To  keep  examples  compact  and  accessible, 
we  present  concise  reviews  of  background  from  different  fields  when  needed 
(including  population  biology  and  chemical  reactions  in  Chap.  1,  fluid  dynamics  in 
Chap.  2  and  classical  mechanics  in  Chap.  3),  but  we  seek  to  maintain  focus  on  the 
modelling  techniques  and  the  properties  of  solutions  that  can  be  obtained.  We  direct 
interested  readers  to  books  that  present  more  detailed  case  studies  of  problems 
needing  more  extensive  background  in  specific  application  areas  [8,  27,  37,  38,  51, 
69,  96]. 

The  structure  of  this  book  follows  the  description  of  the  modelling  process 
described  above: 

•  Part  I:  Formulation  of  models 

This  part  consists  of  four  chapters  that  present  fundamental  approaches  and 
exact  methods  for  formulating  different  classes  of  problems: 

1.  Rate  equations:  simple  models  for  properties  evolving  in  time 

2.  Transport  equations:  models  involving  structural  changes 

3.  Variational  principles:  models  based  on  optimisation  of  properties 

4.  Dimensional  analysis:  determination  of  the  number  of  essential  system 
parameters 

•  Part  II:  Solution  techniques 

This  part  presents  methods  for  obtaining  approximate  solutions  to  some  of  the 
classes  of  problems  introduced  in  Part  I. 

5.  Similarity  solutions:  determining  important  special  solutions  of  PDEs  using 
scaling  analysis 

6.  Perturbation  methods:  exploiting  limiting  parameters  to  obtain  expansions 
of  solutions 

7.  Boundary  layers:  constructing  solutions  having  non-uniform  spatial 
structure 

8.  Long- wave  asymptotics:  reduction  of  problems  on  slender  domains 

9.  Weakly-nonlinear  oscillators:  predicting  cumulative  changes  over  large 
numbers  of  oscillations 

10.  Fast/slow  dynamical  systems:  separating  effects  acting  over  different 
timescales 

11.  Reduced  models:  obtaining  essential  properties  from  simplified  versions  of 
partial  differential  equation  problems 

•  Part  III:  Case  studies:  some  applications  illustrating  uses  of  techniques  from 
Parts  I  and  II. 
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While  this  book  cannot  be  an  exhaustive  introduction  to  all  types  of  mathematical 
models,  we  seek  to  develop  intuition  from  the  ground-up  on  formulating  equations 
and  methods  of  solving  models  expressible  in  terms  of  differential  equations. 

The  book  is  written  in  a  concise  and  self-contained  form  that  should  be 
well-suited  for  an  advanced  undergraduate  or  beginning  graduate  course  or  inde¬ 
pendent  study.  Students  should  have  a  background  in  calculus  and  basic  differential 
equations.  Each  chapter  provides  references  to  sources  that  can  provide  more  detail 
on  topics  that  readers  wish  to  pursue  in  greater  depth.  We  note  that  the  examples 
and  exercises  are  an  important  part  of  the  book  and  will  introduce  readers  to  many 
classic  models  that  have  become  important  milestones  in  applied  mathematics  for 
illustrating  important  or  universal  solution  structures.  Some  of  these  highlights 
include: 

•  The  Burgers  equation  (Chaps.  2,  4,  5) 

•  The  shallow  water  equations  (Chaps.  2,  4,  6) 

•  The  porous  medium  equation  (Chaps.  5,  8,  11) 

•  The  Korteweg  de  Vries  (KdV)  equation  (Chaps.  8,  9) 

•  The  Fredholm  alternative  theorem  (Chap.  9) 

•  The  van  der  Pol  equation  (Chaps.  9,  10) 

•  The  Michaelis-Menten  reaction  rate  model  (Chap.  10) 

•  The  Turing  instability  mechanism  (Chap.  11) 

•  Taylor  dispersion  (Chap.  11) 

Solutions  are  provided  to  many  exercises.  Readers  are  encouraged  to  work  through 
the  exercises  in  order  to  gain  a  deeper  understanding  of  the  techniques  presented. 
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Part  I 

Formulation  of  Models 


Chapter  1 

Rate  Equations 


Many  real-world  systems  can  be  described  in  terms  of  the  evolution  of  state  variables 
for  various  system  properties,  starting  from  given  initial  configurations.  The  sim¬ 
plest  models  for  such  problems  are  given  by  ordinary  differential  equations  (ODE) 
describing  the  rates  of  change  of  the  state  variables  as  functions  of  time. 
Applications  include: 

•  Mechanics:  motion  of  masses  subjected  to  forces 

•  Modern  physics:  radioactive  decay  of  materials 

•  Statistical  systems:  queues,  games,  multi-stage  processes 

•  Chemistry/Biochemistry:  chemical  reactions 

•  Biology:  epidemic  models  for  diseases  in  populations 

•  Ecology:  dynamics  of  populations  of  predator  and  prey  species 

While  the  dynamics  in  some  of  these  applications  may  rest  on  discrete  events,  like 
the  radioactive  decay  of  an  atom  or  the  death  of  an  individual  in  a  population,  when 
averaged  over  large  populations,  reliable  mean-rates  of  activities  can  occur;  this  is 
the  basis  for  writing  ODE  rate  equations  and  other  continuous  models.  ODEs  cannot 
describe  all  of  the  details  of  the  processes  occurring  in  these  systems,  but  they  provide 
a  good  starting  point  for  investigations,  often  yielding  accurate  predictions  of  various 
phenomena. 

For  a  system  involving  n  evolving  properties,  the  governing  mathematical  problem 
is  typically  written  in  the  form 

t),  x(0)=x0.  (1.1) 

dt 

where  the  vector  of  state  variables,  x(t)  =  (x\(t),  X2(t), . . . ,  xn(t))  e  M",  describes 
n  properties  of  interest  in  the  system,  evolving  for  times  t  >  0  and  starting  from  a 
given  initial  state  x(0)  =  xo.  The  rate  functions  for  the  rates  of  change  of  each  jc/  , 
dxi  /dt  =  //,  have  similarly  been  collected  in  a  vector,  f  =  (/i,  fi,  . . . ,  /«),  where 
each  f  can  potentially  depend  on  all  of  the  state  variables. 
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1  Rate  Equations 


System  (1.1),  where  the  rate  functions  have  an  explicit  dependence  on  time,  is 
called  non- autonomous .  In  such  systems,  the  solutions  depend  on  the  particular 
details  of  externally  imposed  time-dependent  influences.  In  this  chapter,  we  will 
focus  on  autonomous  systems , 

~j~  =  f(x),  x(0)=xo,  (1.2) 

where  the  rates  only  depend  on  the  current  state  of  the  solution.  These  systems 
describe  “natural”  or  unforced  behaviour  innate  to  the  system  and  are  the  traditional 
starting  points  for  studying  new  classes  of  problems.  In  later  chapters  we  will  return 
to  the  influence  of  inhomogeneous  forcing  with  applications  such  as  control  theory 
and  driven  systems. 

Problems  that  can  be  stated  in  the  form  (1.2)  are  called  dynamical  systems ,  where 
the  independent  variable  t  usually  represents  time,  but  may  also  represent  other 
properties  depending  on  the  context  of  the  system.  The  most  fundamental  issues  to 
be  addressed  for  a  given  dynamical  system  are 

•  Dynamics:  What  is  the  behaviour  of  the  solution  for  t  >  0  starting  from  given 
initial  conditions  xo?  (i.e.  Will  the  solution  be  oscillatory,  or  monotone  increasing, 
or  exponentially  decaying?  Do  solutions  of  certain  types  exist  for  any  xo?  Does  a 
unique  solution  exist?....) 

•  Stability:  Can  the  system’s  behaviour  be  predicted  starting  from  broad  sets  of  initial 
conditions?  (i.e.  Do  the  solutions  starting  from  other  initial  conditions  follow  the 
behaviour  of  the  solution  starting  from  xo?) 

In  the  simplest  cases  (in  dimensions  n  =  1  and  n  =  2),  the  complete  behaviour  of 
systems  can  be  understood  from  local  properties  of  the  rate  functions  and  geometric 
descriptions  of  the  set  of  solutions.  This  can  yield  a  complete  characterisation  of 
the  dynamics  without  the  need  to  attempt  to  explicitly  construct  the  solutions.  For 
n  =  2,  this  geometric  approach  is  called  phase  plane  analysis  and  forms  the  basis 
of  (general)  dynamical  systems  theory. 


1.1  The  Motion  of  Particles 


A  classic  example  of  a  dynamical  system  from  mechanics  is  the  system  for  motion  of 
a  particle,  where  the  unknowns  describing  the  state  of  the  particle  are  its  position,  X, 
and  momentum,  P.  The  rate  of  change  of  position  is  the  velocity,  and  from  Newton’s 
second  law  [67,  91],  the  rate  of  change  of  momentum  is  given  by  the  net  applied 
force,  F,  so  we  can  write 


dX  d  P 

—  =  V,  — 

dt  dt 


(1.3) 


where  P  =  mV  and  the  applied  forces  could  depend  on  position  and  momentum.  If 
the  mass  is  constant,  then  these  equations  can  be  combined  to  give  the  familiar  law 
of  motion  linking  mass  times  acceleration  to  the  net  applied  force, 
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d2X 

YYl -  =  F 

dt 2 


(1.4) 


Note  that  every  nth  order  autonomous  differential  equation  can  always  be  re-expressed 
in  the  form  of  a  system  of  n  (first-order)  rate  equations.  Defining  x  =  (X\,  X2,  X3,  Pi, 
P2,  P3),  system  (1.3)  can  be  put  in  form  (1.2)  as 


dx  I  V/m  \  f. 

—  =f(x)  with  f  =  (  F(x,P)  )  eR  •  (1-5) 

Newton’s  laws  give  a  well-defined  procedure  for  constructing  the  dynamical  system 
for  a  mechanics  problem.  For  other  fields,  different  principles  provide  guidance;  we 
will  review  how  to  set  up  rate  equations  for  problems  in  chemistry  and  biology. 


1.2  Chemical  Reaction  Kinetics 


For  chemical  systems  [6],  the  fundamental  principle  for  translating  chemical  reac¬ 
tions  into  corresponding  sets  of  rate  equations  is  given  by  the  law  of  mass  action . 

In  “simple”  (or  elementary)  reactions,  generically  of  the  form 

k 

reactants  ->  products,  (1.6) 


the  rate  of  creation  of  products  depends  on  the  concentrations  of  available  reactants 
and  is  also  characterised  by  a  rate  constant  k.  The  rate  of  consumption  of  reactants 
also  follows  from  this  relation. 

We  will  denote  the  concentration  of  chemical  ‘A’  by  A(t)  >0,  and  the  total  rate 
of  production  of  A  will  have  contributions  from  its  creation  and/or  consumption  due 
to  each  chemical  reaction  involving  A, 


dA 

dt 


N  N 

+  (creation  rate)/7  —  (consumption  rate)/7. 

n = 1  n  —  1 


(1.7) 


The  rate  equations  form  a  system  of  first-order  equations,  one  for  each  chemical 
in  the  set  of  reactions,  {A,  B,  C,  . . .},  and  the  rate  functions  will  be  polynomials  in 
terms  of  products  of  concentrations  of  the  reactant  chemicals. 

We  briefly  summarise  the  basic  forms  of  elementary  chemical  reactions  and  the 
corresponding  rate  equations  that  follow  from  the  law  of  mass  action: 


An  Chap.  3  we  will  consider  a  different  approach. 
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1  Rate  Equations 


(0)  Constant  supply:  compound  A  is  pumped  into  the  system  at  a  constant  rate  k 


k  a  A 

(source)  ->  A  =>►  —  =  k.  (1.8) 

dt 

This  is  called  a  zeroth-order  reaction  since  the  rate  does  not  depend  on  the 
concentration  of  any  reactants,  k  =  k  •  1  =  kA°. 

(i)  Decay:  substance  A  transforms  into  waste  at  rate  k  (i.e.  A  decomposes  and  is 
removed  from  the  system) 

k  dA 

A  — >  (waste)  =>►  —  =  —kA.  (1.9) 

dt 

This  is  called  a  first-order  reaction  since  the  reaction  rate  depends  linearly  on 
the  concentration  of  a  single  reactant. 

(ii)  Transformation :  A  is  consumed  with  B  being  produced  from  A 

k  dA  dB 

A  ->  B  =>  —  =  -kA,  —  =  +ifcA.  (1.10) 

dt  dt 

This  is  the  simplest  reaction  “system”,  having  two  distinct  concentrations  evolv¬ 
ing  due  to  a  single  reaction. 

(iii)  Reversible  transformation:  A  transforms  into  B  and  vice  versa.  Such  reactions 
should  be  explicitly  expanded  into  separate  forward  and  reverse  reactions. 

B  =  {  A  X  B  and  A  X-  B  } 

dA  dB 

—  =  —  k\A  +  k2B,  — =  k\A  —  k2B.  (1.11) 

dt  dt 

In  this  system,  the  net  rate  of  production  of  each  substance  is  obtained  by 
summing  the  reaction  rate  from  the  reaction  producing  it  minus  the  rate  from 
the  reaction  consuming  it,  as  in  (1.7). 

(iv)  Compound  formation:  A  and  B  combine  to  form  C 


A  +  B  X  C  =>- 

dA  dB  dC 

—  =  - kAB ,  —  =  —kAB,  —  =  kAB.  (1.12) 

dt  dt  dt 

The  production  rate  of  C  being  proportional  to  the  product  of  the  reactant 
concentrations  follows  from  a  probabilistic  description  of  the  collision  of  inde¬ 
pendent  molecules  [6].  The  probability  of  forming  a  molecule  of  C  increases 
when  either  of  the  concentrations  of  A,  B  increases  (and  clearly  the  reaction 
will  not  proceed  if  either  is  absent,  A  =  0  or  B  =  0). 

However,  suppose  A  and  B  are  the  same  chemical,  then  from  (1.12) 


k  i 


1.2  Chemical  Reaction  Kinetics 


7 


a  +  aXc  =  2 aX  c  =>■ 

—  =+kA2  (but  — y^-kA2  (?))  (1.13) 

dt  \  dt  ) 

The  equation  for  the  rate  of  consumption  of  A  cannot  be  correct  because  it 
implies  that  A,  C  have  equal  but  opposite  rates  of  change,  while  we  know 
that  formation  of  each  molecule  of  C  should  consume  two  molecules  of  A. 
This  is  an  issue  with  “double-counting”  that  points  to  the  need  for  a  more 
precise  definition  of  the  “reaction  rate,”  as  will  be  addressed  in  the  final  type  of 
elementary  reaction. 

(v)  Multiple  products:  n  molecules  of  A  and  m  molecules  of  B  react  to  produce 
p  molecules  of  C  and  q  molecules  of  D : 

n  A  +  m  B  p  C  +  q  D  (1.14a) 

The  problem  associated  with  (1.13)  is  resolved  by  defining  the  reaction  rate  to 
be 


Reaction  Rate  =  —rate  of  consuming  one  unit  of  reactant 

=  +rate  of  creating  one  unit  of  product. 


Then  for  (1.14a)  we  get 


1 dC  IdD 

Rate  = - = - 

p  dt  q  dt 


and  can  write  the  rate  equations  as 


1  dA 
n  dt 


1  dB 
m  dt 


kAnBm 


dA 

dB 

reactants:  — 

=  —nkAnBm 

=  —mkAnBm 

(1.14b) 

dt 

dt 

products: 

dC  =pkAnBm 
dt  F 

d°  =qkAnBm 
dt  H 

(1.14c) 

Consequently,  the  rate  equation  for  A  for  the  reaction  (1.13)  is  now  correctly 
given  by 


dA 

dt 


(U5) 


For  systems  without  losses  or  sources  of  chemicals,  as  in  the  case  of  (1.10)  or 
(1.11),  physical  expectations  based  on  the  conservation  of  mass  suggest  that  A  +  B  = 
constant,  which  is  validated  by  evaluating  ^  (A  +  B)  using  the  rate  equations.  This  is 
called  a  conservation  law  linking  the  products  and  reactants.  Typically,  the  value  for 
the  constant  is  set  by  the  summed  initial  concentrations  and  can  be  used  to  express 
one  concentration  in  terms  of  the  other.  For  example,  if  A  +  B  =  constant,  then 
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1  Rate  Equations 


B(t )  =  (Ao  +  Bo)  —  A(0,  allowing  us  to  reduce  the  number  of  unknowns  in  the 
system.  In  some  cases  there  may  be  various  combinations  of  reactants/products  that 
can  be  used  to  write  a  conservation  law.  For  example,  for  (1.14a),  one  possible  form 
of  mass  balance  is  given  by 


d 

dt 


1.3  Ecological  and  Biological  Models 

Rate  equations  are  also  widely  used  for  predicting  population  growth  and  the  spread 
of  epidemics  (namely,  the  growth  of  the  population  of  diseased  individuals)  [18,  74]. 

The  simplest  model  of  population  growth,  called  Malthus’  law,  describes  a  growth 
rate  proportional  to  the  current  population,  and  generates  exponential  growth, 

—  =kA  =>•  A(t)  =  A0ekt.  (1.16) 

dt 

In  analogy  with  the  description  of  chemical  reactions,  using  (1.14a)  the  growth  of 
the  population  would  be  due  to  additions  by  births,  with  the  parent  remaining  in  the 
population, 


a4a  +  A  =*  —  =  ~PA  +  2fiA  =  fiA.  (1.17) 

dt 

We  note  that  the  rate  constant  /3  would  incorporate  the  time  needed  for  the  birth 
process  (the  gestation  period)  as  well  as  the  fraction  of  the  total  population  who 
choose  to  become  parents.  More  generally,  the  net  rate  of  population  growth  would 
be  the  difference  between  the  birth  rate  /3  and  the  death  (decay)  rate  8,  k  =  /3  —  8. 

The  weakness  of  Malthus’  law  with  respect  to  predicting  unlimited  growth  of 
populations  lead  to  an  improved  model  developed  by  Pierre  Verhulst  (1804-1849), 
usually  called  the  logistic  equation , 

—  =  (P-8)A-yA2  =  (k-yA)A,  (1.18) 

dt 

where  the  coefficient  y  scales  the  influence  of  competition  effects  in  decreasing 
the  birth  rate  (and/or  increasing  the  death  rate)  in  growing  populations.  Here,  the 
effective  growth  rate,  k(A )  =  k  —  yA,  depends  on  the  population  size  and  changes 
the  dynamics  from  growth  (k  >  0)  for  small  populations  to  decay  (k  <  0)  for  large 
populations.  The  borderline  between  these  two  cases  defines  a  critical  population 
size,  A*  =k/y,  called  the  carrying  capacity,  which  allow  (1.18)  to  be  written  as 
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The  logistic  equation  is  the  simplest  model  that  makes  progress  in  capturing  the 
influence  of  limited  external  resources  (e.g.  food)  on  growth  of  the  population. 

A  more  detailed  way  to  describe  the  coupling  of  populations  to  their  use  of  re¬ 
sources  is  to  introduce  additional  rate  equations  to  also  describe  the  growth/decay 
of  the  resources.  Consider  a  population  of  rabbits  whose  number  evolves  according 
to  the  logistic  equation  (1.18).  Let  that  population  of  rabbits,  A(t ),  also  serve  as  the 
food  supply  (prey)  for  a  population  of  foxes  (predators),  B(t).  Let  the  foxes  have  a 
constant  death  rate  and  reproduce  only  when  food  available,  analogous  to  a  chemical 
reaction  of  the  form,  A  +  B  — >  B  +  B.  The  influence  of  the  consumption  of  prey 
by  predators  is  called  predation  and  generates  an  additional  loss  term  in  the  logistic 
equation  (1.19)  for  the  rabbit  population.  Along  with  the  rate  equation  for  the  fox 
population,  this  yields  a  Lotka-Volterra  predator-prey  system  [45,  74], 


dA  9  dB 

—  =  (/3-S)A-  yA2  -  pAB,  —  =  -kB  +  oAB.  (1.20) 

dt  dt 


In  the  context  of  the  spread  of  diseases  [18],  simple  models  of  epidemics  divide 
the  total  population  into  sub-groups  depending  on  whether  individuals  are  infected 
(7(0),  susceptible  to  the  disease  ( S(t ))  or  recovering  from  the  disease  ( R(t )).  The 
transitions  between  these  states  can  be  interpreted  as  reactions 


/  +  $-►/  +  /,  I  ^  R, 


yielding  a  basic  version  of  what  are  generally  called  SIR  models 

dS  dl  dR 

—  =  -kSI,  —  =kSI  -  yl ,  —  =  yl .  (1.21) 

dt  dt  dt 

This  particular  system  conserves  the  total  population,  N  =  S  +  I  +  R,  but  other 
formulations  can  allow  for  growing  or  declining  overall  populations.  Diseases  for 
which  immunity  is  not  achievable  can  be  described  by  reversible  transitions  between 
susceptible  and  infected  states,  analogous  to  (1.11), 

h  dS  dl 

S  I  =4>  — -  =  —  k\S  +  k^I,  —  =k\S  —  fyl,  (1.22) 

k2  dt  dt 

and  are  generally  called  SIS  models.  Many  further  extensions  are  possible,  including, 
for  example,  subdividing  the  infected  population  into  individuals  that  are  in  an  earlier, 
exposed  phase,  E(t ),  versus  a  later  infectious  phase,  /  (t),  called  SEIR  models  [19]. 
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1  Rate  Equations 


1.4  One-Dimensional  Phase-Line  Dynamics 


The  simplest  applications  described  above  were  expressed  as  initial  value  problems 
for  a  single  autonomous  ODE  (e.g.  (1.15),  (1.19)), 


dx 

dt 


=  fix). 


(1.23) 


If  the  rate  function  f(x)  is  linear  or  a  polynomial  function  of  x,  then  various  methods, 
such  as  separation  of  variables  can  be  used  to  determine  the  exact  solution.  However, 
we  will  see  that  many  problems  will  yield  more  complicated  rate  functions  where 
the  solution  cannot  be  explicitly  calculated.  For  this  reason,  it  is  very  useful  to  have 
a  general  theory  that  provides  an  understanding  of  all  solutions  of  (1.23)  without 
having  to  find  explicit  solutions.  Further,  even  when  a  solution  can  be  written  out,  it 
may  be  complicated  to  analyse.  In  contrast,  there  is  a  qualitative  approach  that  can 
be  easier  to  calculate  and  to  interpret. 

The  dynamics  of  the  solutions  of  (1.23)  can  be  described  by  a  one-dimensional2 3 
“phase  line”  in  terms  of  the  graph  of  the  function  f(x)  (see  Fig.  1.1  (left)).  Assuming 
that  f(x)  is  a  nice  function  the  behaviour  of  all  solutions,  x(t),  can  be  described 
qualitatively  in  relation  to  the  properties  of  f(x)  at  its  zeroes.  The  values  v  =  v* 
where  f(x*)  =  0  are  called  equilibrium  points  of  (1.23).  The  values  v*  define  steady 
state  solutions  since  where  dx/dt  =  0,  solutions  starting  at  v*  remain  there  forever. 
The  behaviour  of  solutions  starting  from  within  a  small  neighbourhood  around  v* 
can  be  analysed  by  approximating  fix)  by  its  Taylor  series  about  v  =  x*, 

fix )  %  fix*)  +  f\x*){x  -  x*)  +  \  f"{x*){x  -  x *)2.  (1.24) 


Writing  the  separation  from  the  equilibrium  point  as  u  (t)  =  x(t)—x*,  (1.24)  becomes 

fix)  ~  0  +  au  +  bu 2  +  cu 3  +  •  •  •  as  u  0,  (1-25) 


where  a  =  f\x*).  If  a  ^  0,  then  as  u  — >  0  (corresponding  to  x  — >  **),  (1.23)  can 
be  approximated  by  the  linearised  equation, 

du  nt 

—  —  au  =>►  u(t)  ~  Ce  1 ,  (1.26) 

dt 

and  hence  yields  x(t)  ~  v*  +  Ceat  with  C  ^  0  for  any  solution  not  starting  exactly 
at  the  equilibrium  point.  For  a  >  0  the  equilibrium  is  locally  ‘repelling’  since  the 
separation  from  the  equilibrium  point  increases  with  time,  while  conversely  for  a  <  0 
the  separation  decreases  with  time  and  the  equilibrium  is  locally  ‘attracting’.  More 
generally,  if  any  solution  starting  near  an  equilibrium  point  leaves  the  neighbourhood 


2 Here  one-dimensional  indicates  that  the  dynamics  of  solutions  can  be  understood  in  terms  of  a 
single  variable,  x. 

3  With  /  being  bounded  and  sufficiently  smooth. 
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Fig.  1.1  {Left)  A  smooth  rate  function  f(x)  with  three  equilibrium  points,  (Right)  corresponding 
dynamics  on  the  phase  line  for  (1.23)  obtained  from  local  properties  of  /  at  its  equilibrium  points 


of  x*  as  t  — >>  oo,  then  x*  is  called  asymptotically  unstable ,  while  if  all  solutions 
starting  within  the  neighbourhood  approach  x*  as  t  — >  oo  then  the  equilibrium  is 
called  asymptotically  stable.  The  characterisation  of  the  dynamics  of  the  solutions 
near  x*  based  on  the  linearised  equation  (1.26)  is  commonly  called  linear  stability 
analysis. 

In  graphical  terms,  x*  is  stable  if  the  slope  /'(x*)  is  negative  (a  <  0)  and  unstable 
if  the  slope  is  positive  (a  >  0).  If  we  consider  x(t)  to  be  the  position  of  a  point  moving 
along  the  x-axis  in  Fig.  1.1  (the  phase  line)  then  this  follows  from  /’ s  role  as  the 
velocity,  or  rate  of  change  of  position  x.  If  x  starts  to  the  right  of  x*3  (xo  >  **3) 
where  /(x)  <  0  then  x(t)  will  decrease  (move  to  the  left,  dx/dt  <  0)  back  towards 
v*3.  In  contrast  if  xo  >  x*2  where  f(x)  >  0  then  x(t)  would  increase  with  time 
(< dx/dt  >  0),  moving  away  from  x*2. 

Linear  stability  results  provide  guidance  to  understanding  the  global  dynamics 
(not  limited  to  small  neighbourhoods  of  the  x*’s).  Since  /  changes  sign  only  at  its 
zeroes  (where  we  have  assumed  f'(x *)  to  be  non-zero),  solutions  starting  at  xo  within 
intervals  between  zeroes  will  either  be  monotone  increasing  (moving  to  the  right  on 
the  phase  line)  or  decreasing  (moving  to  the  left  on  the  phase  line)  depending  on  the 
sign  of  /.  This  is  consistent  with  the  results  of  the  local  stability  analysis  at  each  x* 
being  controlled  by  the  sign  of  f'(x *). 

The  case  where  a  =  f'(x *)  =  0  is  called  a  degenerate  equilibrium  point;  the 
local  analysis  for  u  — >  0  is  still  addressed  using  the  Taylor  series  (1.25),  but  now  the 
first  nontrivial  term  in  the  expansion  is  (at  least)  quadratic  and  so  the  rates  of  growth 
or  decay  will  be  algebraic  rather  than  exponential.  If  /"(x*)  7^  0,  then  the  ODE 


du 

dt 


=  bu 


2 


(1.27) 


can  be  used  to  show  that  a  degenerate  equilibrium  point  x*  is  unstable  for  any  b  7^  0. 
Similarly,  for  the  case  with  /'(x*)  =  /"(x*)  =  0,  but  /"'(x*)  7^  0  yielding 
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Fig.  1.2  The  graph  of 
y  —  f{x)  —  h(x)  —  k  for 
(1.29),  giving  the  dynamics 
for  x(t)  on  different  phase 
lines,  graphically 
parametrised  by  y  —  k 


x 


dU  n 

—  =  CU  , 
dt 


(1.28) 


a  third-order  degenerate  equilibrium  point  can  be  shown  to  be  stable  if  f"'(x*)  <  0. 

The  dependence  of  solutions  on  parameters  in  the  system  will  be  an  important 
aspect  of  many  problems.  Consider  the  ODE 

dx  o 

—  =  h(x)  —  k  where  h(x)  =  x  —  x  ,  (1-29) 

dt 


where  k  is  a  parameter.  Matching  to  (1.23),  we  identify  the  rate  function  as  f(x)  = 
h(x)  —  k.  Plotting  f(x)  in  Fig.  1.2,  we  illustrate  how  the  family  of  problems  (1.29) 
parametrised  by  k  can  be  understood  from  this  graph.  Observe  that  for  different  values 
of  the  constant  k ,  the  horizontal  lines  y  =  k  form  a  “stacked”  set  of  phase  lines  cutting 
through  the  curve  y  =  h(x)  at  the  equilibrium  points,  fix *)  =  0,  corresponding 
to  that  value  of  k.  The  positions  of  the  equilibrium  are  functions  of  k,  but  more 
significantly,  the  number  and  type  of  the  equilibrium  points  change  for  different 
ranges  of  k ,  such  qualitative  changes  in  the  structure  of  sets  of  solutions  are  called 
bifurcations.  For  (1.29),  the  dynamics  for  all  times  is  restricted  to  a  single  phase  line, 
with  y  =  k  fixed,  in  the  next  section  we’ll  review  systems  with  coupling  to  a  second 
rate  equation  for  y(t)  yielding  general  motion  in  the  vy  plane.  Despite  its  restricted 
form,  we  will  see  that  problems  like  (1.29)  occur  as  pseudo-two-dimensional  phase 
plane  structures  in  reductions  of  more  complicated  systems  in  Chap.  10. 


1.5  Two-Dimensional  Phase  Plane  Analysis 

For  systems  of  two  coupled  autonomous  rate  equations  (called  phase  plane  systems) 
the  approach  of  the  previous  section  can  be  extended  to  similarly  give  a  qualitative 
understanding  of  all  solutions  from  just  local  properties  of  the  rate  functions. 
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For  n  =  2  with  x(t)  =  ( x(t ),  y(t)),  Eq.  (1.2)  can  be  written  as 


dx 

dt 


dy_ 

dt 


=  g(x,y), 


with  initial  conditions  at  t  =  0, 


x(0)=x0,  y(0)  =  y0. 


(1.30a) 


(1.30b) 


Interpreting  variable  t  as  time,  this  problem  can  be  interpreted  as  describing  the 
motion  of  a  point  in  the  xy  “phase  plane”  starting  from  position  xo  =  (xo,  To) 
subject  to  a  position-dependent  velocity  v  =  (/,  g).  The  solution  of  (1.30a,  1.30b) 
is  a  parametric  curve  (or  trajectory )  passing  through  the  point  xo. 

Using  the  chain  rule  to  eliminate  t  from  y(x)  =  y(x(t )),  (1.30a)  leads  to  the 
associated  ODE  (also  called  the  slope  field  equation ), 


dy_  _  g(x,  y) 
dx  /(x,y )' 


(1.31) 


Solving  this  first  order  ODE  for  y(x)  yields  a  family  of  implicit  solutions  called  the 
integral  curves  that  trace  out  the  same  paths  in  the  plane  as  the  trajectories.  The 
implicit  solutions  give  only  the  shape  of  the  curves  while  parametric  solutions  also 
give  the  direction  of  motion  on  the  curves  with  respect  to  increasing  t. 

Analogous  to  the  analysis  of  the  phase  line,  equilibrium  points  ( x * ,  y *)  are  defined 
by  positions  where  both  rate  functions  vanish, 


/(x*,y*)  =  0,  g(x*,y*)=0.  (1.32) 


With  the  exception  of  equilibrium  points,  results  on  uniqueness  of  solutions  ensure 
that  each  point  xo  has  a  single  solution  curve  passing  through  it  and  that  those  curves 
cannot  cross.  Consequently,  assuming  /  and  g  are  smooth,  the  solutions  are  smooth 
curves  everywhere  in  the  phase  plane  except  at  the  equilibrium  points.  Observe 
from  (1.31)  that  at  an  equilibrium  point  the  slope  dy /dx  is  indeterminate  (formally 
dy /dx  =‘0/0’)  and  requires  more  careful  analysis  to  describe  the  local  structure  of 
solutions. 

Near  an  equilibrium  point,  define  u(t)  =  (x(t)—x*,y(t)  —  y*)satisfyingdu/dt  = 
f(u  H-  x*),  and  using  a  multi- variable  Taylor  series  approximation  of  f  for  |u|  — >  0 
yields  the  linearised  system 

du 

—  =Au,  (1.33) 

dt 

where  the  matrix  A  is  the  Jacobian  (or  gradient  Vf),  evaluated  at  (x*,  y*), 


A  =  J(x*,  y*) 


Ac/(**,  y*)  dyf(x*,  y*)\ 

\dxg dyg(x*,y*)J 


(1.34) 
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Seeking  solutions  of  the  form  u(7)  =  veXt  then  reduces  (1.33)  to  the  matrix  eigen¬ 
value  problem, 

Av  =  Av. 


At  each  equilibrium  point,  linear  algebra  yields  the  eigenvalues  as  the  roots  of  the 
characteristic  polynomial  obtained  from  the  setting  the  determinant  to  zero, 

|A  —  AI|  =0. 

Subsequently,  for  each  eigenvalue,  the  eigenvectors  can  be  obtained  by  row- 
reductions  as  the  nontrivial  null  vector  of  (A  —  A^I)v^  =  0.  The  general  solution 
of  the  linearised  system  is  then  given  by  the  linear  combination  of  the  eigenmodes, 

x(t)  ~  x*  +  c\\\eXlt  +  C2V2^2?  for  |x  —  x* |  — >►  0.  (1.35) 

If  A  does  not  have  a  complete  set  of  eigenvectors  (a  possibility  with  repeated  eigen¬ 
values)  then  this  form  must  be  modified. 

Extending  the  discussion  of  asymptotic  stability  from  Sect.  1.4  in  terms  of  all 
solutions  approaching  x*  or  any  diverging  from  it,  the  stability  of  solutions  starting 
near  x*  can  be  understood  in  terms  of  the  eigenvalues,  see  Table  1.1.  Problems  having 
Re(A)  =0  fall  into  a  degenerate  case,  called  a  centre  manifold,  and  must  be  studied 
more  carefully,  somewhat  like  (1.27,  1.28). 

While  for  the  phase  line  case  the  behaviour  of  solutions  could  be  inferred  directly 
from  the  slope  of  the  rate  function,  A  =  a  =  the  geometry  for  the  phase  plane 

case  is  more  complicated.  The  stability  properties  are  still  set  by  the  derivatives  of 
the  rate  functions  at  x*  but  now  the  different  cases  are  most  conveniently  expressed 
in  terms  of  the  eigenvalues  of  A,  (1.35).  The  local  geometry  of  the  integral  curves 
near  non-degenerate  x*  is  then  given  by  the  cases  outlined  in  Table  1.2,  where  the 


Table  1.1  Asymptotic  stability  of  equilibrium  point  x*  in  terms  of  eigenvalues  from  the  linear 
stability  analysis 


A’s 

Stability 

t  00 

Both  Re(A)  <  0 

Stable 

o 

t 

'ss 

Either  Re(A)  >  0 

Unstable 

u(f)l  — ►  oo 

Table  1.2  Geometry  of  trajectories  near  equilibrium  point  x*  in  terms  of  the  eigenvalues 


A’s 

Name 

Geometry 

Stability 

k2  same  sign 

Node 

Rays 

Re(A) 

Ai,  A.2  opp.  signs 

Saddle 

Hyperbolas 

Unstable 

•  ^ 

II 

<< 

Centre 

Circles 

Neutral 

A  =  a  ±  i/3 

Spiral 

Spirals 

Re(A) 
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eigenvectors  set  the  orientation  of  the  geometry  (most  notably  the  directions  of  the 
stable  and  unstable  axes  of  a  saddle  point). 

If  f(x,  y )  and  g(v,  y)  are  linear  functions,  then  there  will  be  a  single  equilibrium 
point  and  the  above  analysis  describes  the  form  of  the  entire  phase  plane.  Otherwise, 
this  gives  a  description  in  a  neighbourhood  surrounding  each  x*.  However,  using 
the  results  on  uniqueness  of  trajectories  (i.e.  curves  may  cross  only  at  equilibrium 
points),  trajectories  can  be  smoothly  extended  and  a  global  structure  can  be  sketched 
by  piecing-together  the  local  linearised  geometries  around  each  of  the  equilibrium 
points. 

In  addition  to  systems  of  rate  equations,  phase  plane  analysis  can  also  be  applied 
to  second-order  autonomous  ODEs  for  x(t),  such  as 


d2x 
dt 2 


(1.36) 


which  can  be  written  as  an  equivalent  system  by  defining  an  intermediate  variable 
y(t): 


dx 

dt 


dy_ 

dt 


=  g(x,  y). 


(1.37) 


In  this  system,  the  relationship  x'  =  y  defines  the  direction  of  trajectories  as  follows: 

•  x'  >  0  :  x(t)  increasing  (— >)  for  y  >  0  (upper  half  vy  plane) 

•  x'  <  0  :  x(t)  decreasing  («— )  for  y  <  0  (lower  half  vy  plane) 

This  choice  for  the  intermediate  variable  is  not  unique  but  has  a  nice  physical  in¬ 
terpretation  in  term  of  (x,  y)  =  (position,  velocity)  with  the  v-axis  being  states  at 
rest  (zero  velocity).  An  example  of  a  different  choice  for  y  (t)  will  be  given  in  a  later 
chapter. 

As  an  example,  consider  the  pendulum  equation  for  the  angular  position  of  a 
suspended  mass  swinging  under  the  influence  of  gravity  [7], 

d20 

— y  +  sin0=O.  (1.38) 

dtz 


Letting  x(t)  =  6(t)  and  following  (1.37),  we  arrive  at  the  phase  plane  system, 


dx 

dt 


=  y. 


dy_ 

dt 


=  —  sin  v . 


(1.39) 


Applying  (1.32),  the  equilibrium  points  are  given  by  y*  =0  with  v*  =  nn  for 
n  =  0,  ±1,  ±2, _ From  (1.34)  the  Jacobian  matrix  at  any  (x,  y)  is 


3(x,  y) 
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Fig.  1.3  The  phase  plane  for  the  pendulum  (1.39):  (Left)  Sketch  of  linearised  behaviours  in  neigh¬ 
bourhoods  of  the  equilibrium  points  and  (Right)  a  computed  plot  of  the  full  phase  plane 


Evaluating  the  Jacobian  at  the  equilibrium  point  (0,  0)  yields 

A  =  J(0,  0)  =  ^  ^  =>>  A2  +  1  =  0,  A  =  =b/  (a  centre), 

while  the  equilibrium  point  (tv,  0)  yields 


A  =  J(7T,  0) 


>  A2  —  1  =  0,  A  =  ±1  (a  saddle  point), 


with  eigenvectors  (1,  —1)  and  (1,  1).  These  vectors  define  the  asymptotes  of  the 
family  of  hyperbolas  centred  at  this  equilibrium  point,  called  the  stable  manifold  (for 
A  <  0)  and  unstable  manifold  (for  A  >  0),y  =  ±(x  —  n),  see  Fig.  1.3  (left).  Other 
equilibrium  points  at  ([2k  +  1]tt,  0)  also  have  exactly  the  same  form  by  the  periodic 
nature  of  g(v).  Figure  1.3  shows  that  the  full  phase  plane  smoothly  extends  the  local 
behaviours  at  the  equilibria  to  cover  all  possible  solutions.  It  is  notable  that  trajectories 
connecting  the  saddles  (called  heteroclinic  orbits)  separate  small-amplitude  periodic 
oscillations  abound  0  =  0  (the  continuous  family  of  closed  curves)  from  high-speed 
“spinning”  solutions,  where  the  angle  increases  (or  decreases)  monotonically  for  all 
time. 


1.5.1  Nullclines 

Nullclines  are  curves  in  the  vy  plane  that  can  provide  further  understanding  of  sys¬ 
tems  by  dividing  the  phase  plane  into  regions  with  different  behaviours.  The  nullclines 
are  not  solutions  of  the  system,  but  they  do  yield  valuable  insight  on  the  properties 
of  solution  trajectories  that  pass  through  the  nullcline  curves  (or  lie  on  one  side  or 
the  other). 
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The  x-nullcline  is  a  curve  in  the  phase  plane  on  which  dx/dt  =  0.  On  this  nullcline 
x(t)  instantaneously  has  zero  rate  of  change,  hence  v  is  fixed  while  y(t)  changes 
with  time,  so  a  trajectory  passing  through  the  v -nullcline  will  have  a  vertical  tangent. 
The  v -nullcline  is  given  by  the  implicitly  defined  curve  f(x,y)  =  0.  The  v -nullcline 
also  separates  solutions  that  have  x(t)  increasing  from  those  that  have  v  decreasing 
with  time, 


Region  where  f(x,y )  <  0: 
Region  where  f(x,y )  >  0: 


dx 

—  <  0 
dt 

dx 

—  >  0 
dt 


x(t)  decreasing  («— ) 
x(t)  increasing  (— >) 


Analogously,  the  y-nullcline  is  a  curve  on  which  dy /dt  =  0.  Here  y(t)  instanta¬ 
neously  has  zero  rate  of  change  and  a  trajectory  passing  through  the  y-nullcline  will 
have  a  horizontal  tangent.  The  y-nullcline  is  given  by  the  implicitly  defined  curve 
g(x,  y)  =  0.  The  y-nullcline  also  separates  solutions  that  have  y(t)  increasing  from 
those  that  have  y  decreasing  with  time, 


Region  where  g(x,  y)  <  0: 

A 

o 

y(t)  decreasing  (|) 

Region  where  g(x,  y)  >  0: 

o 

A 

y(t)  increasing  (f ) 

Combining  the  information  on  horizontal  and  vertical  components  of  motion  from 
the  v,  y-nullclines  respectively,  yields  very  useful  local  qualitative  information  on 
the  directions  of  trajectories  that  can  be  extended  to  understand  the  structure  of  the 
phase  plane.  In  particular,  note  that  the  intersections  of  the  nullclines  are  equilibrium 
points  and  the  transitions  in  values  of  dx/dt  and  dy/dt  can  help  identify  the  type  of 
equilibrium  point  without  doing  the  linear  stability  analysis. 

As  an  example  consider  the  system 


dx 

dt 


dy_ 

dt 


=  x2  +  y2  —  4. 


(1.40) 


The  first  equation  gives  the  v -nullcline  as  the  parabola  y  =  x2  /\/2.  Above  the 
parabola  x(t)  is  increasing  with  time,  while  below  the  parabola  x(t)  is  monotone 
decreasing.  On  the  parabola,  trajectories  have  vertical  tangents.  The  second  equation 
in  (1.40)  gives  the  y-nullcline  as  the  circle  x2  +  y2  =  4.  Inside  the  circle  y(t)  is 
decreasing,  while  outside  the  circle  y  (t)  is  increasing.  On  the  circle,  trajectories  have 
horizontal  tangents. 

The  intersection  of  the  nullclines  makes  the  locations  of  the  two  equilibrium  points 
immediately  clear.  These  points  could  also  be  obtained  analytically  from  (1.32)  as 
(v* ,  y*)  =  (±  V 2,  V2).  Subsequently,  the  eigenvalues  for  the  linear  stability  analysis 
at  the  equilibrium  points  can  be  obtained  by  solving 
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-2  0  2 

X 


Fig.  1.4  Nullclines  for  system  (1.40):  (Left)  directional  information  determined  from  the  x-  and 
y-nullclines  and  its  application  to  local  structure  near  the  two  equilibrium  points,  (Right)  computed 
trajectories  for  the  system 


1 


2  y*  —  X 


At  (*+,  >’*)  we  get  A.  i  3.356  End  X2  ~  — 2.528  yielding  a  saddle  point.  At 
(x~,  y *)  we  get  an  unstable  spiral  with  X\p  ~  2.414  ±  1.630/.  The  nature  of  these 
equilibria  could  be  inferred  from  information  provided  by  the  nullclines  without  these 
slightly  cumbersome  algebraic  calculations,  see  Fig.  1.4.  The  usefulness  of  nullclines 
becomes  more  significant  in  reducing  more  complicated  systems  and  constructing 
proofs  about  properties  of  families  of  solutions. 


1.6  Further  Directions 

The  analysis  of  the  linear  stability  of  equilibrium  points  can  be  extended  to  dynamical 
systems  in  ^-dimensions  [70].  However,  the  phase  plane  is  special  because  some 
geometric  arguments  do  not  extend  in  a  simple  way  to  curves  in  space  W7  for  n  >  3. 
Consequently  it  is  very  helpful  when  higher-order  systems  can  be  reduced  down  to 
phase  planes. 

The  material  in  this  chapter  gives  only  a  very  brief  review  of  selected  fundamen¬ 
tal  results.  For  more  detailed  background  on  phase  planes  and  further  coverage  of 
bifurcations  and  dynamical  systems  theory,  see,  for  example  [43,  45,  54,  70,  94]. 

Comprehensive  presentations  of  chemical  reaction  systems  are  found  in  most 
chemistry  textbooks,  for  example  [6].  More  concise  introductions  are  given  in  the 
applied  mathematics  books  by  Holmes  [49]  and  Keener  and  Sneyd  [57]. 

The  models  described  here  by  systems  of  ODE  are  the  simplest  type  of  rate  equa¬ 
tions,  sometimes  also  called  state-space  models,  with  the  rates  of  evolution  dependent 
on  the  current  solution  state.  In  delay  differential  equations ,  the  current  rate  depends 
on  the  solution  at  an  earlier  time,  dx/dt  =  f(x(t  —  1))  [35].  Discrete-time  maps , 
also  called  difference  equations  are  analogous  algebraic  equations  describing  evolv- 
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in g  solutions  at  discrete  times,  say  year-to-year,  as  in  xn+\  =  f(xn)  [5].  Stochastic 
differential  equations  have  randomly  varying  contributions  to  the  rate  functions  and 
are  sometimes  used  to  represent  the  variations  among  the  dynamics  of  individuals 
in  a  large  population,  improving  on  (1.2)  which  describes  a  uniform  average  behav¬ 
iour  [36].  More  detailed  models  of  populations,  called  structured  population  models 
divide  up  the  population  by  age  or  size;  if  these  are  treated  as  continuous  variables, 
then  the  rate  equations  yield  partial  differential  equations.  We  will  touch  on  some  of 
these  topics  in  later  chapters.  Haberman’s  book  [45]  also  provides  very  accessible 
introductions  to  several  of  these  types  of  models. 


1.7  Exercises 

1.1  Consider  the  problem  of  tracking  the  vertical  position,  z(t ),  of  a  rocket  whose 
mass  changes  as  it  consumes  its  fuel.  If  the  rocket  starts  from  rest  at  z(0)  =0  with 
initial  mass  m( 0)  =  mo  and  obeys 

dz\  dm 

m(t) —  I  =  —mg  +  rm,  —  =  —m, 

dt )  dt 

solve  the  ODEs  to  determine  z(t)  and  determine  the  condition  on  r  that  is  necessary 
for  lift-off. 

1.2  Use  basic  solution  methods  for  first  order  ODEs  to  solve  the  elementary 
reactions  (1.8-1.13)  for  A(t),  B(t),  C(t )  starting  from  initial  conditions  Ao,  Bo,  Co 
respectively. 

1.3  Write  the  four  rate  equations  for  chemicals  C,  E,  P,  S  governed  by  the  reac¬ 
tions 

k\  k'i 

S  +  E  C  -4  P  +  E. 

k2 


1.4  Write  the  rate  equations  for  X(t),  Y ( t ),  Z(t)  describing  the  reaction  system 

A  B  D  G  H 

(source)  ->  X  Y  ^  X  Y  3  Y  ^  Z  Z  (waste). 

C  E 

1.5  Consider  the  dynamics  of  x(t)  satisfying  the  first  order  ODE, 


dx 

dt 


=  x  —  x4 5  —  k  v(0)  =  vq, 


for  different  values  of  the  parameter  k  (see  Fig.  1.2). 


4  We  delay  solving  more  complicated  systems  of  reactions  to  Chap.  10.  Here  we  only  want  to  set  up 

the  rate  equations. 
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(a)  For  k  =  0,  obtain  the  exact  solutions  starting  from  vo  =  1/2  and  vo  =  —2.  Show 
that  these  solutions  match  the  linearised  results  from  (1.26)  at  the  equilibrium 
points  x*  =  {  —  1,  0,  1}  for  the  long-time  behaviour  ( t  — >  oo),  and  the  “pre¬ 
history”  of  the  solution  ( t  — >  —  oo).  Show  that  for  all  initial  conditions  with 
vo  7^  0,  one  of  two  final  states  are  approached  as  t  — >  oo. 

(b)  Determine  the  two  values  of  k  for  which  the  ODE  has  only  two  distinct  equilib¬ 
rium  points.  Sometimes  called  critical  values  or  bifurcation  points ,  such  parame¬ 
ter  values  determine  special  cases,  where  different  analysis  is  needed  to  describe 
the  system;  in  this  case,  this  will  involve  second-order  degenerate  equilibrium 
points.  Determine  the  coefficient  b  occurring  in  the  equation  for  the  local  be¬ 
haviour  of  solutions  near  the  degenerate  equilibrium  points,  du/dt  =  bu2  with 
u(t)  =  x(t)  —  x *. 

Show  that  for  this  problem,  these  bifurcation  values  separate  ranges  of  k  where 
there  are  global  attractors  (unique  final  states  approached  by  all  initial  conditions 
for  t  — >  oo)  from  cases  where  two  stable  equilibrium  states  co-exist  (called  bi¬ 
stability). 

1.6  Consider  the  equations  for  local  behaviour  at  second-  and  third-order  degenerate 

equilibrium  points,  (1.27)  and  (1.28). 

(a)  Use  separation  of  variables  to  solve  the  ODE  analytically  and  subsequently 
describe  the  stability  of  the  equilibrium  point  at  u  =  0  for  both  choices  of  the 
sign  of  b. 

(b)  Using  only  the  sign  of  the  velocity  on  the  phase  line  consider  solutions  starting 
from  initial  conditions  with  uq  ^  0  to  obtain  the  stability  of  the  equilibrium 
point  without  the  need  for  the  ODE  solutions. 

(c)  Repeat  (a,  b)  for  the  dependence  of  (1.28)  on  c. 

1.7  Consider  the  second  order  ODE  for  x  (t), 

d2x  i  o 


(a)  Let  y  =  dx/dt  and  write  the  ODE  as  a  phase  plane  system.  Determine  the  linear 
stability  properties  of  the  two  equilibrium  points. 

(b)  Show  that  the  solutions  satisfy  the  equation 

\{x)2  -  \x2  +  \x2  =  H, 

where  H  is  a  constant  of  integration,  sometimes  called  the  Hamiltonian . 

(c)  For  a  range  of  values,  0  <  x  <  M,  this  problem  has  a  continuous  set  of  periodic 
solutions.  Show  that  the  maximum  and  minimum  values  of  x(t)  of  each  periodic 
solution  satisfy  a  polynomial  equation  involving  H.  Define  the  amplitude  of  the 
oscillations  as  A  =  vmax  —  vmin.  Show  that  A  =  0  corresponds  to  an  equilibrium 
point.  Show  that  the  largest  amplitude  solution  has  vmin  =  0;  determine  its  value 
for  vmax(=M).  What  is  the  range  of  values  for  HI 
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(d)  Use  t  =  f  dt  =  J  to  show  that  the  period  of  oscillation  for  these  solutions 
is  given  by 

•Tmax 

dx 

■Jx2  —  ^x2  +  2  H 


(e)  Show  that  the  simpler  piecewise-linear  model 


d2x 
dt 2 


=  f(x) 


with 


fix)  = 


2  —  x 


x  <  1 

X  >  1 


has  the  same  equilibrium  points  and  the  same  linear  stability  properties  at  the 
equilibria. 

Solve  the  two  linear  problems 

XA  (0)  =  xmin  x'A  (0)  =  0 

xB(P/2 )  =  xmax  x'B(P/2)  =  0 

and  construct  a  periodic  solution  of  the  piecewise-linear  model  by  enforcing 
smoothness,  x'A  =  x'B  at  x a  =  xb  =  1. 


dZXA 
dt 2 


=  XA 


d2XB 

dt2 


=  2-  xB 


Chapter  2 

Transport  Equations 


Many  systems  exhibit  evolution  over  time  with  properties  of  interest  that  vary 
throughout  their  spatial  domains.  Examples  of  such  systems  arise  in  population 
dynamics ,  which  describes  the  distribution  of  individuals  in  some  population  and 
how  they  interact.  “Individuals”  could  refer  to  molecules,  electrons,  particles,  ani¬ 
mals,  people,  company  stocks,  or  network  messages. 

One  way  to  study  the  overall  population  is  to  attempt  to  track  each  individual,  such 
approaches  are  sometimes  called  individual-based  models .  The  tracking  process  is 
usually  very  labour  intensive  and  involves  collecting  a  lot  of  data  on  the  actions  of 
all  individuals.  If  this  level  of  detail  is  not  crucial  and  a  more  ‘large-scale’  view  is  of 
interest,  then  continuum  theory  may  be  a  better  option. 

Continuum  theories  yield  evolution  equations  with  respect  to  properties  that  are 
averaged  over  small  intervals  of  time  and  small  regions  of  space.  In  such  cases,  we 
implicitly  assume  a  continuum  hypothesis  which  states  that  appropriately  averaged 
behaviours  of  individuals  can  be  generally  predicted  from  trends  in  the  local  popula¬ 
tion.  We  consequently  formulate  continuum  models  as  partial  differential  equations 
(PDE)  governing  the  evolution  of  density  functions  (f(x,t))  describing  properties  of 
the  population  at  a  given  position  and  time.  Integrating  the  density  over  the  entire 
domain  can  be  used  to  capture  the  time-dependence  of  the  property  on  the  whole 
population,  ■r(t)  =  J f  (x,  t)dx. 

Some  specific  applications  of  continuum  models  include: 

•  Fluid  dynamics:  the  flow  of  liquids  and  gases  (density  of  molecules  in  space) 

•  Solid  mechanics:  the  deformation  of  solids  (density  of  molecules  in  space) 

•  Electromagnetics:  the  flow  of  electric  currents  in  materials  (density  of  electric 
charges  in  space) 

•  Scattering  theory:  dynamics  due  to  collisions  in  high  energy  particle  physics  (den¬ 
sity  of  particles  having  different  velocities) 
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•  Age- structured  population  dynamics:  birth,  death,  and  aging  of  a  population 
(distribution  of  individuals  having  different  ages)  [28,  74] 

•  Size-structured  population  dynamics:  growth  and  decay  of  physical  sizes  of  indi¬ 
viduals  in  a  population  (distribution  of  individuals  having  different  sizes)  [8] 

Fluid  dynamics  and  solid  mechanics  are  often  grouped  together  under  the  heading 
of  continuum  mechanics.  Other  population  models  focus  not  on  change  in  position, 
but  on  properties  like  stock  price  in  financial  models,  popularity  in  social  networks 
or  genetic  traits  in  biological  systems. 

In  each  of  these  contexts,  PDEs  can  be  used  to  describe  the  redistribution  of  the 
property  of  interest  over  time.  Conveying  shifts  or  “motion”  in  the  density,  whether 
with  respect  to  spatial  position  or  with  the  independent  variable  x  representing  other 
properties  (e.g.  velocity,  age,  size),  such  PDEs  are  also  broadly  called  transport 
equations.  The  common  structure  shared  by  these  models  is  having  a  PDE  for  the 
rate  of  change  of  the  density  involving  the  spatial  gradient  of  a  flux  function,  which 
characterises  transport  of  the  property  within  the  population.  Transport  equations 
are  fundamental  for  describing  problems  in  many  fields  extending  from  theoretical 
physics,  chemical  engineering,  and  mathematical  biology  to  probability  theory. 

In  this  chapter  we  introduce  the  fundamental  approach  for  formulating  transport 
models  (conservation  laws  and  the  Reynolds  transport  theorem).  We  then  go  on  to 
describe  the  method  of  characteristics,  a  methodology  for  constructing  exact  solu¬ 
tions  to  basic  transport  models. 


2.1  The  Reynolds  Transport  Theorem 

As  a  conceptual  starting  point,  we  consider  how  we  might  describe  the  dynamics 
of  individuals  in  a  large  population — describing  their  motion  (change  of  absolute 
position  as  a  function  of  time,  X(f))  and  deformation  (rearrangement  or  change  of 
relative  position  within  groups  of  surrounding  individuals). 

In  order  to  introduce  the  basic  principles,  we  begin  by  studying  the  case  of  passive 
transport  of  inert  particles  carried  by  an  externally  imposed  flow  field,  for  example, 
particles  of  dust  cloud  in  the  wind  or  a  pollutant  carried  in  a  running  river.  Here 
“passive”  indicates  that  the  presence  of  the  particles  does  not  influence  the  flow 
driving  their  motion. 

Each  particle  can  be  uniquely  identified  by  its  initial  position  at  time  t  =  0, 

X(t  =  0)  =  A  =  (A,  B,  C)  (  X(t  =  0)  =  A  in  ID  )  (2.1a) 

Assume  an  imposed  velocity  field,  v(x,  t),  is  given  that  specifies  the  speed  and  direc¬ 
tion  that  a  particle  occupying  position  x  would  take  at  time  t.  Having  an  explicit 
expression  for  properties  in  a  fixed  coordinate  system  is  called  an  Eulerian  descrip¬ 
tion.  From  the  definition  of  velocity  as  the  rate  of  change  of  position,  the  motion  of 
a  particle  is  given  by 
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dX  (  dX 

—  =  v(X,  t).  (  —  =  v(X,  t)  in  ID 

dt  \  dt 

This  initial  value  problem  gives  the  motion  of  a  particle  over  time,  and  is  sometimes 
called  the  problem  for  the  particle’s  pathline . 

An  alternative  point  of  view  is  to  recognise  that  for  a  particle  starting  from  position 
(2.1a),  the  Eulerian  description  of  the  entire  velocity  field  is  not  needed.  All  that  is 
essential  is  the  velocity  along  the  particle’s  pathline.  The  Lagrangian  description 
gives  a  property  following  the  motion  of  a  given  particle  as  a  function  of  time.  The 
Lagrangian  velocity  for  the  particle  starting  from  position  A  is 

\(t;  A)  =  v(X(f;  A),f).  (2.2) 

The  problem  for  the  pathline  can  then  be  restated  in  Lagrangian  form  as 

-j-  =  V(L  A),  X(0;  A)  =  A.  (2.3) 

dt 

We  will  show  that  the  ability  to  change  between  these  two  equivalent  forms  will 
enable  us  to  solve  transport  equations. 

Values  of  other  properties  carried  by  point  particles  (e.g.  density,  temperature, 
radioactivity)  can  similarly  be  expressed  in  both  Eulerian  f(x,t )  and  Lagrangian 
F(t)  forms  via 

F(f;A)=/(X(f;A),f).  (2.4) 

In  describing  the  evolution  of  any  property  /  following  a  particular  particle,  it  is 

necessary  to  calculate  the  rate  of  change  of/ for  the  given  particle/part  (0  =  F(t;  A)  = 

f(X(t;  A),  t).  Using  the  chain  rule,  we  can  express  this  “Lagrangian  time  derivative” 
in  terms  of  Eulerian  functions, 


) 


(2.1b) 


dt 


9f 

3 t 


+  v  •  V/  = 


Df_ 

Dt 


(2.5) 


where  we  have  used  (2.1b).  The  Eulerian  form  of  this  derivative  is  called  the  con¬ 
vective  (or  material )  derivative  and  is  denoted  We  will  see  that  the  velocity  field 
defining  the  flow  has  a  special  role  in  transport  equations.  If  v  is  given,  then  the 
problem  for  the  evolution  of  the  property  of  interest  is  called  a  kinematics  problem 
(as  in  passive  transport).  If  the  evolution  of  v  is  coupled  to/  and  must  be  determined 
as  part  of  the  solution,  then  it  is  a  more  challenging  dynamics  problem . 

The  next  stage  in  formulating  a  continuum  model  is  to  determine  the  rate  of 
change  of  a  property  evaluated  over  a  u material  blob  ” — in  other  words,  we  consider 
a  specific  set  of  particles,  defined  as  starting  from  a  set  of  A  values  (2.1a)  occupying 
a  region  D  in  space.  Lor  example,  picture  the  blob  as  the  fluid  in  a  small  droplet.  The 
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cumulative  value  of  property  /  over  the  material  blob  is  given  by 


fblobiO  — 


/(x,  t)dV, 


D(t) 


(2.6) 


where  D(t )  is  the  region  occupied  by  the  moving,  deforming  blob  at  time  t.  We  would 
like  to  know  how/biob(0  varies  with  time.  In  one  dimension,  a  “blob”  is  simply  a 
time-dependent  interval,  a(t )  <  x  <  bit)  and  (2.6)  reduces  to 


fb(t) 


/biob(0  =  /  / (x,  t)  dx.  (2.7) 

J  a(t ) 

To  find  the  rate  of  change,  we  apply  Leibniz’s  rule  to  determine  the  derivative  of  an 
integral  with  time-dependent  endpoints, 


d 

dt 


bit) 


f(x,  t)  dx  )  = 


a{t) 


■b 


a 


df  db  da 

~  dx  +/ (b,  t)—  —f(a,  t)  — 
at  dt  dt 


fix,  t) 


dx 


i  x=b 


dt  I  x=a 


>b  r 


a  L 

>b  r 


df  d  (  dx  . 

+  —{fix,  t)—  1  I  dx 


dt  dx 


dt 


)] 


3/  3 

/  +  —(Tv)  dx 
dt  dx 


(2.8) 


where  again,  we  have  made  use  of  (the  one-dimensional  version)  of  (2.1b).  In  two 
and  three  dimensions,  making  use  of  the  divergence  theorem,  we  can  generalise  this 
result  to  give  the  Reynolds  Transport  Theorem  [1], 


d  , 
~dt  ' 

S 

II 

S 

r  9/  i 

Lar+V'H 

dV 

(2.9) 


2.2  Deriving  Conservation  Laws 

In  order  to  apply  the  Reynolds  transport  theorem  to  obtain  a  transport  model,  we 
need  the  introduction  of  a  conservation  principle.  This  is  a  statement  providing 
information  about  the  rate  of  change  of  /bi0b  that  applies  to  all  possible  material 
blobs. 

For  example,  requiring  conservation  of  mass — the  principle  that  total  mass  can  be 
neither  created  nor  destroyed,  the  mass  of  every  blob  must  remain  constant  in  time. 
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Expressing  the  mass  in  terms  of  the  material  density,/  =  p,  in  one-dimension  this 
gives 

[b{t)  dmwob 

mbiob(0  =  /  p(x,t)dx,  - =  0  V(a,  b).  (2.10) 

Ja{t )  dt 

In  order  to  change  from  a  statement  about  properties  of  blobs  to  a  transport  PDE,  the 
final  tool  involved  is  an  integral  result  from  analysis,  sometimes  called  the  du  Bois- 
Reymond  lemma  [49],  which  we  state  in  its  simplest  form  on  the  one-dimensional 
domain  0  <  v  <  1  as 


If  f  g(v)  dx  =  0  0  < 

J  a 

In  other  words,  if  an  integral  vanishes  for  all  choices  of  sub-domains,  then  the  inte¬ 
grand  must  vanish  on  the  whole  domain.1  This  result  allows  us  to  convert  from  an 
integral  equation  (called  a  weak  form,  applying  on  the  domain  as  a  whole)  to  a  dif¬ 
ferential  equation  (called  a  strong  form  that  applies  locally,  pointwise  at  each  x  in 
the  whole  domain),  if  the  integral  is  valid  for  all  blobs. 

Applying  the  Reynolds  Transport  theorem  to  (2.10),  we  reduce  the  principle  of 
conservation  of  mass  to  a  PDE  for  the  density,  yielding  the  (local)  conservation  law 
for  mass  density  in  one  dimension, 


V(a,  b)  <1,  then  g(x)  =  0  for  0  <  x  <  1  .  (2.11) 


dp 

~dt 


+ 


—  (pv)  =  0, 
dx 


(2.12) 


also  known  as  the  continuity  equation.  In  three  dimensions,  the  corresponding  result  is 


^\IIIDi)P(X’t)dV)=0  VD  ^  +  V  •  (pv)  =  0.  (2.13) 

The  quantity  in  the  parenthesis  is  called  the  flux,  q,  and  corresponds  physically  to 
the  rate  of  p  passing  through  a  fixed  point  per  unit  time  (here  q  =  py). 

If  we  are  given  information  on  the  rate  at  which  the  property  of  interest  is  created 
or  destroyed,  say  due  to  a  chemical  reaction  as  in  dp/dt  =  R  like  (1.8),  then  the  rate 
of  change  of  the  total  amount  of  the  chemical  in  domain  D  be  can  expressed  by 


j,{IILpdV)=IILRdV-  (Z14> 

This  should  be  true  in  any  subdomain  (any  material  blob).  If  transport  is  present  then 
D  =  D(t )  and  after  applying  the  Reynolds  Transport  Theorem  to  the  left  integral, 
we  can  regroup  the  resulting  integrals  together  as 


1  Assuming  the  integrand  to  be  a  smooth  function. 
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+  V  •  (pv)  -  R 


dV  =  0 


(2.15) 


Then  applying  the  du  Bois-Reymond  lemma  yields  the  general  conservation  law 
including  reaction  terms  (sometimes  called  source  or  sink  terms  depending  on 
whether  the  rate  of  production  is  positive  or  negative), 


dp 

-£  +  V  •  (pv)  =  R. 
ot 


(2.16) 


Equation  (2.14)  can  be  applied  to  Newton’s  second  law,  stating  that  the  rate  of 
change  of  momentum  is  equal  to  the  sum  of  the  applied  forces 


j,(IILrdV)=IIL,dV'  <2-17) 

where  f  gives  net  forces  per  unit  volume.  Applying  the  Reynolds  transport  theorem, 
the  du  Bois  Reymond  lemma,  and  expressing  the  forces  in  terms  of  the  divergence 
of  a  stress  tensor,  f  =  V  •  a,  yields  the  Cauchy  momentum  equation , 

3(pv) 

— —  +  V  •  (pvv)  =  V  •  or.  (2.18) 

dt 

Encapsulating  the  principles  of  conservation  of  mass  and  momentum,  equations 
(2.13)  and  (2.18)  are  the  basis  of  continuum  mechanics  [41,  50].  Supplemented  by 
appropriate  equations  for  defining  the  stress  in  terms  of  p  and  v,  called  constitu¬ 
tive  relations  or  equations  of  state ,  these  yield  the  governing  equations  for  solid 
mechanics  and  the  Navier-Stokes  equations  for  fluid  dynamics. 


2.3  The  Linear  Advection  Equation 


In  this  section  we  will  focus  on  the  most  fundamental  case  of  (2.16),  where  R  =  0 

/V 

(no  reactions)  and  the  velocity  is  a  uniform  constant  vector,  v  =  ci  (constant  speed), 
yielding  the  one-dimensional  advection  equation 


dp 

~dt 


+  c 


dp 

dx 


(2.19) 


All  solutions  of  the  advection  equation  can  be  written  as  constant  profile  travelling 
waves , 


p(x,  t)  =  P(x  —  ct), 


(2.20) 


where  the  terminology  “travelling  wave”  refers  to  a  solution  moving  with  a  fixed 
velocity  (speed  in  one  dimension)  and  with  a  “constant  profile”  meaning  that  the 
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Fig.  2.1  A  travelling  wave 

solution  (2.20), 

p(x,  t )  =  P(x  —  ct ),  of  (2.19) 

for  c  >  0  at  three  successive 

times  ^2 

it 

cl 


wave  shape  P(x)  given  at  t  =  0  is  maintained  for  all  times.  If  c  >  0,  the  wave  profile 
propagates  to  the  right  (and  to  the  left  if  c  <  0)  (see  Fig.  2.1). 

While  (2.20)  is  a  solution  of  equation  (2.19)  for  any  function  P,  for  the  purpose 
of  considering  the  complexities  associated  with  more  general  transport  models  (with 
other  forms  of  flux),  it  is  helpful  to  first  understand  the  action  of  the  equation  on 
simple  functions.  If  we  can  express  P(x)  as  a  complex  Fourier  series  (see  Appendix 
A),  P(x)  =  Akelkx,  then  the  travelling  wave  (2.20)  can  be  written  as 


oo 

p(x,t)=  £  Akeik^ 
k=—o o 


^  Akj[kx-Mt] 


(2.21) 


where  k  is  called  the  wave  number  (related  to  the  wavelength  between  successive 
crests  of  cos  (be)  in  space,  L  =  2n  Ik)  and  go  is  called  the  angular  frequency  (related 
to  the  period  of  oscillation  of  sin  (cot)  in  time,  T  =  In  I  go).  For  (2.19),  the  expansion 
(2.21)  defines  go  =  ck ,  but  the  Fourier  decomposition  is  applicable  to  other  equations 
as  long  as  a  relationship  can  be  found  between  the  frequency  and  the  wavenumber, 
go  =  Go(k ),  called  the  dispersion  relation. 

Consider  the  fifth  order  linear  homogeneous  constant-coefficient  PDE 


dp  dp 

—  +  c\  — 
dt  dx 


(2.22) 


If  we  substitute  in  a  Fourier  mode  pk(x ,  t)  =  A &  exp(/[bf  —  Got])  (called  a  uniform 
plane  wave)  as  a  trial  solution,  (2.22)  reduces  to  a  relationship  between  go  and  k 
given  by 

—ioo  +  c\(ik)  +  C3  (ik)^  +  cs(ik)5  =  0, 


which  determines  the  dispersion  relation,  Go(k)  =  c\k  —  C3&3  +  c5k5. 

Descriptions  of  wave  properties  in  terms  of  the  dispersion  relation  are  fundamen¬ 
tal  in  many  models  that  arise  as  generalisations  of  the  advection  equation.  The  phase 
speed ,  cp  ( k )  =  go  ( k)/k ,  gives  the  speed  of  waves  with  wavenumber  k.  Equation  (2.19) 
has  constant  phase  speed  (being  independent  of  k ),  and  so  is  called  dispersionless 
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because  all  of  its  wave  modes  travel  at  the  same  speed,  maintaining  the  steady  profile 
P(x  —  ct)  (2.21).  In  contrast,  for  dispersive  equations  with  cp(k )  ^  constant,  the 
profile  given  by  the  sum  p(x ,  t)  =  ^kAjielk^x~cP(^)t^  changes  with  time  as  the  com¬ 
ponent  waves  separate  (“disperse”)  according  to  their  different  speeds.  The  growth 
or  decay  of  waves  is  described  by  dispersion  relations  having  imaginary  components: 
wave  amplitudes  are  maintained  if  co  ( k )  is  purely  real,  while  if  co  ( k )  has  an  imaginary 
part  then  dissipation  occurs.  In  order  to  observe  this  effect,  consider  what  happens 
when  we  substitute  the  plane  wave  solution  into  the  diffusion  equation  pt  =  pxx ;  the 
dispersion  relation  is  found  to  be  co  =  —  ik2,  so  that  the  (real- valued)  phase  speed 

is  cp  =  0  and  the  wave  will  dissipate  without  propagating:  Pk(x,  t)  =  (Ake~k  f)elkx 
(these  correspond  to  so-called  ‘standing  waves’). 

We  now  turn  our  attention  to  describing  methods  for  obtaining  the  solutions  to 
various  generalisations  of  the  advection  equation  (2.19). 


2.4  Systems  of  Linear  Advection  Equations 


Consider  a  system  of  two  coupled  linear  advection  equations  for  properties  p(x,t ), 
q(x,  t)  on  —oo  <  v  <  oo, 

Pt  +  aPx  +  bqx  =  0  (2.23a) 

qt  +  cpx  +dqx  =0  (2.23b) 


where  a,  b,  c,  d  are  constants,  and  the  initial  conditions  are  given  by 

p(x,  0)  =f(x)  q(x,  0)  =  g(v).  (2.23c) 

The  typical  approach  for  determining  the  solutions  of  (2.23)  is  to  decouple  the  system 
into  independent  advection  equations,  each  with  a  travelling  wave  solution  of  the  form 

dw  dw 

- 1-  X — =0  =>  w(x,  t)  =  W(x  —  Xt),  (2.24) 

dt  dx 

where  A  is  the  wave  speed  for  some  appropriate  travelling  wave  profile  W. 

We  start  by  assuming  a  solution  as  a  linear  combination, 

w(x,  t)  =  Ap(x ,  t)  +  Bq(x ,  t),  (2.25) 

for  some  constants  A,  B  to  be  determined.  Forming  the  linear  combination  of 
A-  (2.23a)  +5-  (2.23b)  yields 

( Ap  +  Bq)f  +  (aA  +  cB)px  +  ( bA  +  dB)qx  =  0, 
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while  substituting  w  from  (2.25)  into  (2.24)  gives 


C Ap  +  Bq)t  +  (XA)px  +  ( XB)qx  —  0. 


Comparing  coefficients  in  these  equations  leads  to  a  matrix  eigenvalue  problem, 

(^)(b)=4«)-  (226> 

Solving  this  system  yields  the  eigenvalues  (A)  defining  the  wave  speeds  and  the 
components  of  the  eigenvectors  defining  the  relationship  of  w  to  the  solutions  p  and 
q,  (2.25).  In  order  for  the  eigenvalues  to  be  meaningful  as  speeds,  they  must  be  real 
values.  The  solutions  of  the  eigenvalue  problem,  X\  and  A\,B\,  and  A2  and  A2,  B2 , 
determine  the  travelling  wave  solutions 


wi(x,  t)  =A\p(x ,  t)  +Biq(x,  t)  =  W\(x-  X\t), 

W2(x,  t)  =  A2p{x ,  t)  +  B2q(x,  t)  =  W2(x  —  A20, 

where  the  travelling  wave  profiles  Wi(v),  W2(x)  are  otherwise  undetermined  so  far. 
Applying  initial  conditions  (2.23c)  to  evaluate  these  equations  at  time  t  =  0  explicitly 
defines  the  wave  profiles  as  satisfying 


M fix)  +B\g(x)  =  W  i(v), 
Az f{x)  -\-B2g(x)  =  W2(x). 


(2.28) 


With  Wk  now  known  in  terms  of/,  g  and  the  A^ ,  Bk  coefficients  given  by  the  eigenvec¬ 
tors,  the  right-hand  side  of  (2.27)  is  known,  yielding  a  linear  system  for p(x,  t) ,  q(x ,  t) 
in  terms  of  combinations  of  of  fix  —  X^t)  and  g(x  —  X^t)  (with  k  =  1,2). 

This  approach  extends  to  systems  of  any  number  of  coupled  linear  equations, 


dp 

~dt 


(2.29) 


where  p  =  (p,  q,  r,  . .  .)T  e  M/7  and  M  is  an  n  x  n  constant  coefficient  matrix. 
Assuming  trial  solutions  of  the  form  w  =  u  •  p  where  u  =  (A,  B ,  C .)T  and  noting 
the  relationship  between  (2.26)  and  the  coefficients  in  (2.23),  we  see  that  the  general 
eigenvalue  problem  can  be  stated  as 

Mtu  =  Xu.  (2.30) 


If  all  the  eigenvalues  of  M7'  are  real  and  there  is  a  complete  set  of  eigenvectors,  then 
(2.29)  is  called  a  hyperbolic  system  and  its  solutions  can  be  expressed  in  terms  of 
sums  of  travelling  waves  (see  [65,  80,  81,  106]  for  more  details). 
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2.5  The  Method  of  Characteristics 


Semilinear  wave  equations  take  the  form 


dp 

~dt 


+  c(x,  t) 


dp 

dx 


r(x,  t,p). 


(2.31a) 


where  the  speed  c  depends  on  both  position  and  time,  but  not  on  the  solution  itself, 
and  the  reaction  rate  can  depend  on  all  three,  but  not  on  derivatives  of  the  solution. 
The  terminology  ‘semilinear’  arises  because  the  left-hand  side  of  the  equation  is 
linear  with  respect  to  the  dependent  variable  p ,  while  the  right-hand  side  can  be 
nonlinear  in  p.  Problems  for  this  class  of  equations  with  initial  conditions 


p(x.  0)  =  f(x). 


(2.31b) 


can  be  solved  by  converting  from  the  original  Eulerian  PDE  form  into  a  Lagrangian 
form  that  can  be  solved  as  a  set  of  coupled  ODEs. 

In  order  to  see  how  (2.3  la)  defines  a  property  evolving  in  a  flow,  let  the  Lagrangian 
form  of  the  solution  for  each  initial  material  coordinate  A  be  written  as 


Pit ;  A)  =  p(X(t ;  A),  t)  X(0;  A)  =  A.  (2.32) 


Using  the  chain  rule,  the  rate  of  change  of  P  is 

dP  dp  dp  dX 

_ =  JL  +  J- _  (2  33) 

dt  dt  dx  dt 

and  this  can  be  matched  term  by  term  with  the  PDE  (2.3 1  a)  if  the  Lagrangian  variables 
evolve  according  to  the  characteristic  equations 


dP  dX 

—  =  r(X(t),  t,  P(t)),  —  =  ciXit),  t) 

dt  dt 


The  initial  conditions  (2.31b)  take  the  corresponding  form 


(2.34a) 


P(0;  A)  =/(A),  X(0;  A)  =  A.  (2.34b) 


The  solution  of  the  X-equation  subject  to  its  initial  condition  therefore  defines  a 
path  in  the  (v,  t)  plane  known  as  a  characteristic  curve  (or  simply  a  ‘characteristic’), 
and  generalises  the  concept  of  a  pathline.  Once  X(t)  has  been  determined,  it  can  be 
substituted  into  the  evolution  equation  for  P  (2.34a) i  and  can  then  be  solved  for  P(t) . 

Thus,  the  so-called  method  of  characteristics  replaces  the  Eulerian  PDE  with  an 
initial  value  problem  for  a  pair  of  ODEs.  The  Lagrangian  solutions  (X(t;  A),  Pf\  A)) 
are  parameterised  by  the  initial  material  coordinate  A  and  can  sometimes  be  inverted 
to  give  the  explicit  Eulerian  solution  pix,  t). 
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As  an  example,  consider  the  transport  problem  for  p(x ,  t), 


dp  dp  9 

- 1-  2x —  =  xp  ,  p(x,  0)  =  2  +  sin(v). 

dt  dx 


(2.35) 


Following  the  above  analysis,  the  corresponding  characteristic  problem  is  given  by 


dX 

dt 

dP 

dt 


=  2X, 


XP2, 


X(0)  =A, 


P(  0)  =  2  +  sin  (A), 


(2.36a) 


(2.36b) 


Solving  (2.36a)  yields  X(t;  A)  =  Ae2t ,  which  we  can  then  substitute  into  the  ODE 
for  P  (2.36b)  to  obtain  the  general  solution 


dP 

dt 


=  Ae2tP2  =>•  Pit)  = 


1 


C  -  \Ae2t 


Imposing  the  initial  condition  on  P  at  t  =  0  (when  X  =  A)  then  yields  that 


P(t;A)  =  (-  (l  -e2t)  + - 1 - )  , 

\2  V  /  2  +  sin(A) / 

(see  Fig.  2.2).  Inverting  Ae2t  =  X  =  x  gives  A  =  xe~2t .  Substituting  this  into  P(t\  A) 
and  applying  P  =  p  results  in  the  final  form  of  the  solution 


Fig.  2.2  (Left)  Characteristic  curves  in  the  (jc,  t)  plane  for  problem  (2.35).  (Right)  Function  values 
evolving  on  a  characteristic  curve 
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2.6  Shocks  in  Quasilinear  Equations 


Quasilinear  equations  differ  from  semilinear  equations  only  in  having  the  speed  c 
depend  additionally  on  the  solution, 


dp  dp 

—  +  c(x,  t,p)—=  r(x,t,p ) 


This  makes  the  characteristic  ODEs  fully  coupled, 


(2.37) 


dP  dX 

—  =  r(X,  t,  P),  —  =  c(X,  t,  P).  (2.38) 

dt  dt 

For  semilinear  equations,  the  equation  (2.34a)  for  the  characteristic  curves  v  = 
X(t\  A)  decouples  from  P,  and  standard  existence  and  uniqueness  results  guarantee 
that  two  curves  starting  from  different  initial  positions  (and  having  different  values 
of  P(t))  will  never  intersect.  This  is  not  the  case  for  the  coupled  equations  in  (2.38), 
where  characteristics  can  cross  and  hence  predict  several  different  P(t\  Aj)  values 
occurring  simultaneously  at  the  same  v  position. 

As  an  example,  consider  the  inviscid  Burgers  equation , 


dp 

~dt 


(2.39) 


with  initial  conditions 


p(x ,  0)  =  « 


l-M 

o 


M  <  l, 

otherwise. 


(2.40) 


Equation  (2.39)  is  an  important  mathematical  model  that  will  arise  again  later  in 
other  contexts;  one  aspect  of  its  importance  can  be  observed  by  noting  that  if  the 
property  p  is  the  flow  velocity,  then  (2.39)  is  the  convective  derivative  of  the  velocity 
(2.5)  (the  Lagrangian  form  of  the  acceleration),  ^  =  vt  +  vvx  =  0. 

The  characteristic  equations  for  this  example  are 


dX 

dt 


dP 

dt 


X(0  ;A)=A, 


(2.41a) 


P(0;A) 


1-|A|  \A\  <  1, 
0  else. 


(2.41b) 


We  note  that  for  this  problem  P  remains  constant  along  each  characteristic,  so  that 
X  =  Pt  +  A  and  the  solution  can  be  expressed  as 
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X  =  A,  P 

X=(l+A)f  +  A,  P 

X  =  (l  -A)t  +  A,  P 

X  =  A,  P 


0  A  <  — 1, 

1  +  A  -  1  <  A  <  0, 
l- A  0  <  A  <  1, 

0  A  >  1 


and  restated  in  an  explicit  form  for  t  >  0 


p(x,  t) 


0 

(1  +x)/(l  +  0 
(1  -x)/(l  -t) 
0 


x  <  —1, 

—  1  <  X  <  t, 
t  <  X  <  1, 

X  >  1. 


(2.42) 


(2.43) 


There  are  several  points  to  note  about  this  representation  of  the  solution — first,  the 
piecewise-defined  solutions  are  not  mutually  exclusive  for  t  >  1.  Consequently 
multiple  values  are  being  predicted  for  p  at  some  locations  when  t  >  1 .  In  relation 
to  this,  the  third  subcase  becomes  undefined  at  time  t  =  1,  changing  from  negative 
slopes  for  t  <  1  to  positive  slopes  for  t  >  1 .  Figure  2.3  shows  the  characteristic  curves 
and  p(x ,  t)  profiles  given  by  (2.43).  We  observe  that  the  portion  of  the  solution  starting 
from  v  e  [—1,0]  spreads  out  over  an  increasingly  large  region  as  its  characteristics 
separate  from  each  other  (this  is  sometimes  called  an  expansion  fan  or  rarefaction 
wave). 

In  contrast,  the  portion  of  the  solution  starting  from  v  e  [0,  1]  is  being  compressed 
into  a  smaller  region  (up  until  t  =  1)  and  is  referred  to  as  a  compressive  wave.  Its  slope 
steepens  and  overturns  for  t  >  1  to  yield  what  could  be  described  as  a  multi-valued 
“breaking  wave” . 

Mathematically  this  part  of  the  solution  is  predicting  three  values  for  a  physical 
quantity  (maybe  a  density,  concentration  or  temperature,  for  example)  that  should 
have  a  unique  value  at  any  point  v  at  a  given  time  t. 


x  x 

Fig.  2.3  (Left)  characteristic  curves  X(t;  A)  given  by  (2.42)  in  the  xt  plane  and  a  3D  view  of  the 
evolving  multi-valued  solution  p(x,  t)  (2.43)  (Right) 
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x  x 

Fig.  2.4  {Left)  The  multivalued  solution  (2.43)  and  insertion  of  a  shock  at  v  =  xs(t),  (Right) 
reduction  to  a  single- valued  solution  given  by  (2.46) 


There  is  a  systematic  and  rigorous  approach  for  correcting  this  unphysical  behav¬ 
iour  and  modifying  the  solution  to  make  it  single- valued  through  the  use  of  shocks. 
Shocks  are  moving  jump  discontinuities  in  the  solution  that  separate  one  piecewise- 
defined  portion  of  the  solution  ahead  of  the  shock,  p+(x,  t),  from  another  part  behind 
the  shock,  p-(x ,  t ),  eliminating  the  multi-valued  behaviour  (see  Fig. 2.4).  Shock 
waves  are  common  in  many  physical  systems,  including  acoustics  (sonic  booms), 
fluid  flow  (hydraulic  jumps),  and  traffic  flow  (moving  traffic  jams). 

The  construction  of  shock-corrected  solutions  builds  on  the  idea  that  if  a  shock  is 
inserted  at  one  position,  v  =  xs(t ),  it  can  appropriately  separate  the  overlapping  sets 
of  characteristic  curves  to  produce  a  well-defined  single-valued  solution  everywhere 
away  from  the  shock.  The  constructed  solution  should  satisfy  all  of  the  properties 
expected  for  the  conservation  law  and  we  use  this  to  derive  the  equation  for  the 
motion  of  the  shock  position. 

Consider  a  general  quasilinear  equation  describing  the  transport  of  a  property 
p(x,  t)  according  to  a  flux  function  q  =  q(p). 


dp  dq(p) 

dt  dx 


(2.44) 


which  has  been  derived  for  smooth  solutions  via  the  Reynolds  transport  theorem  from 
a  conservation  law  for  p.  The  integrated  form  of  (2.44)  on  a  fixed  domain  a  <  x  <  b 
includes  contributions  from  fluxes  at  the  ends  of  the  domain, 


d 

dt 


x=b 


+  q(p ) 


x—a 


(2.45) 


If  a  shock  were  inserted  at  some  position  x  =  xs(t),  the  piecewise-defined  form  of 
the  solution  becomes 
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p(x ,  t)  =  « 


p_(x,  0 


a  <  x  <  xs(t), 
xs(t)  <  x  <  b. 


(2.46) 


Separating  (2.45)  with  respect  to  dependence  on  the  solution  to  the  left  or  right  of 
the  shock  then  yields 


"  d  (  fXs  \ 

'dtp  \ 

—  \j  p-dxj-  q(p-(a,  0) 

+ 

—  1  J  p+dx  1  +  q(p+(b,  0) 

=  0. 
(2.47) 


Applying  Leibniz’s  rule  (2.8)  then  gives 


Xx 


dp-  dxs 

dx+p-(xs,t)  —  |  + 


b  dp+  dxs 

.  .  - ax  —  p+(xs,t)  — 

dt  x  dt  dt 

Xs 

+  [q(p+(b,  0)  -  q(p-(a,  0)]  =  0. 


) 


By  adding  and  subtracting  q(p±)(xs)  and  re-grouping  terms,  we  obtain 


Xx 


dtp-  dx  +  q(p-(xs ,  0)  “  q(P-(a,  0) 


) 


b 


+  1  /  3 tP+dx  +  q(p+(b,t))  -  q(p+(xs,t)) 


-q(p-(xs,  0)  +  q(p+(xs,  0)  +  [p-(xs,  t)  -p+  (xs,  t )] 


dxs 

dt 


The  terms  in  parentheses  on  the  first  two  lines  vanish  based  on  applying  (2.45)  to  the 
smooth  solutions  on  the  sub-intervals  a  <  x  <  xs  and  xs  <  x  <  b  respectively.  The 
remaining  terms  give  the  so-called  Rankine-Hugoniot  shock  speed  relation 


dxs  q(p+(xs,  Q)  -  q(p-(xs ,  t)) 
dt  p+{xs ,  t)  -p-(xs,  t) 


(2.48) 


For  the  inviscid  Burgers  equation  (2.39),  the  flux  is  q(p)  =  ^ p 2,  and  for  our  specific 
example,  p-(x,  t)  =  (1  +  x)/(l  +  t)  and  p+(x,  t)  =  0  yielding  the  shock  speed 
equation, 


dxs  p+(xs ,  t)  -p-(xs,  t)  dxs  _  1  + 

dt  2  dt  2(1  + 1) 


(2.49) 


Initial  conditions  for  this  equation  are  determined  by  the  time  and  position  where 
characteristics  first  cross,  necessitating  the  insertion  of  a  shock;  in  this  case, 
x5(l)  =  1.  Consequently  the  position  of  the  shock  is  given  by 


xs(t )  =  ^2(1  +  t)  —  1  for  t  >  1. 
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Fig.  2.5  Characteristic 
curves  in  the  xt  truncated  by 
the  shock  x  =  x5  ( t ) 
corresponding  to  (2.46) 


X 


Figure  2.5  shows  the  shock  in  the  xt  plane  in  its  role  in  separating  families  of 
characteristics  that  would  otherwise  intersect.  Excluding  the  shock  curve,  Fig.  2.5 
has  a  single  characteristic  curve  passing  through  each  (x,  t)  point  and  hence  describes 
a  single- valued  solution  of  the  transport  problem.  This  figure  differs  from  Fig.  2.4 
(left)  only  in  the  wedge-shaped  region  bounded  by  the  curves  x  =  l  and  x  =  t  that 
form  the  boundaries  of  what  is  sometimes  called  the  shock  envelope. 

Returning  to  Fig.  2.3,  we  note  that  our  solution  p(x,  t)  began  as  an  equilateral 
triangle  on  —  1  <  x  <  1  with  height  one,  and  hence  area  one.  As  time  increases  the 
profile  steepens  toward  the  right  while  maintaining  its  base  and  height,  and  hence 
area,  even  as  it  transitions  from  being  an  acute  triangle  (single-valued  solution)  to  an 
obtuse  triangle  (multivalued  solution)  (also  see  Fig.  2.4  (left)).  The  consequence  of 
introducing  the  shock  is  to  cut  out  the  portion  of  (2.43)  that  overturns  while  modifying 
the  domains  on  which  the  other  parts  of  the  solution  apply  for  t  >  1 , 


p(x,  t) 


0  v  <  —1, 

•  (1  +  x)/(l  +0  — 1  <  x  <  xs(t), 
0  v  >  xs(t). 


(2.50) 


The  resulting  solution  profiles  are  right  triangles  on  the  base  —1  <  x  <  xs(t) 
and  p  ranging  over  0  <  p  <  maxp  =  p-(xs(t),  t)  =  y/2/(  1  +  t).  To  satisfy  the 
conservation  of  the  integral  of  p,  (2.48)  ensures  that  the  shock  maintains  the  area  of 
A  =  \  (base)(height)  =  ^(^2(1  +  t)  —  1  —  (— 1))(1/V2(1  +  t))  =  1 .  This  illustrates 
why  the  shock  selection  rule  is  sometimes  called  the  equal- area  rule:  Later,  we  will 
see  in  Chap.  5  that  solution  (2.50)  can  also  be  obtained  as  a  similarity  solution. 


2See  Fig.  2.4 — the  placement  of  the  shock  not  only  conserves  the  area  of  the  newly-formed  right 
triangle,  but  also  requires  that  the  areas  of  the  two  cut-off  multi-valued  regions  from  the  obtuse 
triangle  to  be  equal. 


2.7  Further  Directions 
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2.7  Further  Directions 

The  classic  text  on  linear  and  nonlinear  waves  is  the  book  by  Whitham  [106].  There 
are  additional  more  recent  books  on  waves  [65,  80]  and  the  method  of  characteristics 
and  its  extensions  are  covered  in  detail  in  most  books  on  applied  partial  differential 
equations  and  modelling  (see  [45,  81],  for  example). 


2.8  Exercises 


2.1  Let  x  =  ait)  and  v  =  b{t)  be  positions  of  two  points  that  move  according  to  a 
flow  dx/dt  =  v(x,  t),  then  if 


'bit) 


^(0  J a(t) 


f(x,  t)  dx, 


df^\ja 

calculate  lim - -.  Hint:  Let  bit)  =  a(t)  +  shit)  with  s  — >  0. 

b—>a  dt 

2.2  In  one  dimension,  the  Euler  equations  for  compressible  gas  dynamics  are 


dp 

~dt 


+ 


—  (pv)  =  o 
OX 


dp 

dx’ 


where  the  gradient  of  the  pressure  Fix,  t)  represents  an  internal  force  generated  by 
the  gas.  Note  that  the  left-hand  side  of  the  second  equation  can  be  written  in  terms 
of  the  convective  derivative,  pDv/Dt.  The  first  equation  is  the  continuity  equation 
for  the  conservation  of  mass.  Show  that  the  second  equation  is  consistent  with  the 
conservation  of  momentum  with  non-constant  density, 

dipv)  dipv2)  dP 

dt  +  dx  dx 

2.3  Consider  the  solution  of  a  linear  wave  equation  having  the  dispersion  relation 
oo  =  60  (k), 


pix,  t)  =  cos ikx  —  coik)t)  +  cos([k  +  s]x  —  coik  +  s)t). 

(a)  Show  that  for  s  — >  0  this  can  be  re-written  in  terms  of  the  phase  velocity ,  cp , 
and  group  velocity ,  cg ,  as 

pix,  t)  =  2 cos ik[x  —  cpik)t ])  cos  s[x  —  cgik)tf)  +  Ois) 

What  is  the  formula  for  the  group  velocity  for  s  — >  0? 
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(b)  Use  p(x,  t)  =  cos  (be  —  cot)  to  determine  the  dispersion  relation  for  the  equation 

Pt  T  Px  Pxxt  —  0  (2.51) 

and  calculate  the  group  velocity.  Also  determine  the  “modified  dispersion  rela¬ 
tion”  cd(k)  from  p(x,  t)  =  which  will  be  used  in  the  next  exercise. 

2.4  Solitons  (or  “solitary  waves”)  are  steady  profile  travelling  waves  describing  a 
single  “pulse”  whose  size  and  speed  are  connected  through  nonlinear  effects.  Con¬ 
sider  the  nonlinear  wave  equation, 


Pt  T  Px  T  6ppx  Pxxt  —  0? 


called  the  Benjamin-Bona-Mahony  equation. 

(a)  By  looking  for  solutions  of  the  PDE  in  travelling  wave  form,  p(x,  t)  =  P(x  —  ct), 
determine  the  ODE  for  P(s)  with  s  =  x  —  ct. 

(b)  Show  that  there  is  a  one-parameter  family  of  solutions  of  the  form 

P(s)  =  Asech2^) 

and  determine  how  A  and  B  are  related  to  the  speed  c. 

(c)  Show  that  the  nonlinear  solitary  wave  surprisingly  satisfies  the  modified  disper¬ 
sion  relation  from  the  linearised  equation  (2.51). 

2.5  Dispersion  relations  are  not  limited  to  being  algebraic  relations.  Consider  the 
following  fluid  dynamics  model  of  water  waves,  given  in  terms  of  a  potential  function 
0(v,  y,  t)  (defined  on  o  <  y  <  1)  and  a  wave  profile  fix,  t)  on  the  surface  of  the 
water  (at  y  =  0), 


(pxx  +  (pyy  —  0 
(py  =  0 

ft  +/  —  0 

ft  =  fy 


on  0  <  y  <  1 , 
aty  =  0, 
aty  =  1, 
aty  =  1. 


Assume  the  wave  is  f(x,  t)  =  A  cos(kx  —  cot).  Show  that  the  corresponding  potential 
must  be  of  the  form  <p(x,  y,t)  =  B(y)  sin  (be  —  cot)  and  obtain  the  dispersion  relation 
co(k). 


2.6  Obtain  explicit  solutions  p  =  p(x,  t)  for  t  >  0  for  the  following  problems, 
(a)  The  initial  value  problem  for  p(x,  t)  on  —  oo  <  v  <  oo: 


dp  2 1  dp 

- Ye  — =p~Yx~Yt,  p(x,  t  =  0)  =  cosv. 

3t  dx 
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(b)  The  signalling  problem3  for  p(x,  t)  on  v  >  0: 
dp  dp 

- 1-  (x  +  4) —  = —2 p,  p(x  =  0,  t)  =  cos  t,  p(x,  t  =  0)  =  e  . 

dt  dx 

2.7  A  one-dimensional  compressible  fluid  blob  starts  at  t  =  0  with  uniform  density 
p  =  \on\<x<2.  The  blob  obeys  the  conservation  of  mass  equation 


‘dp_  d(pv) 
dt  dx 


with  the  (Eulerian)  velocity  field  given  as  v(x,  t)  =  x2e  3t . 

(a)  Find  the  density  of  the  blob  for  t  >  0  as  a  function  of  position  and  time,  p  = 
p(x,  t). 

(b)  Find  the  positions  of  the  moving  left  and  right  edges  of  the  blob,  x\  (t)  <  x  < 

X2  (0  • 

(c)  Use  your  results  from  parts  (a,  b)  to  directly  evaluate  the  integral 


p(x,  t)  dx 


and  show  that  this  is  consistent  with  the  Reynolds  transport  theorem. 


2.8  (The  method  of  characteristics  in  two  dimensions)  At  time  t  =  0,  a  radioactive 
substance  is  released  into  a  steady  two-dimensional  flow.  The  initial  concentration 
of  the  substance  is  given  by 


co(x,y)  =  * 


1  -  (x2  +  j2) 

0 


X2  +y2  <  1, 

else. 


In  the  absence  of  flow,  the  concentration  would  decay  according  to  the  rate  equation 
dc/dt  =  —c.  The  substance  is  carried  by  the  two-dimensional  Eulerian  velocity  field 
v  =  (1  +x,y). 

(a)  The  conservation  law  for  the  decaying  substance  on  any  fluid  blob  D(t)  is 


D(t) 


) 


cdA  I  =  — 


cdA. 


D(t ) 


Use  the  Reynolds  transport  theorem  to  derive  the  PDE  for  the  concentration  field 

c(x,y,  t). 

(b)  Instead  of  writing  “A  =  (Ao,  yo)”  as  a  material  coordinate  in  rectangular  coordi¬ 
nates,  parametrise  the  initial  data  in  terms  of  polar  material  coordinates  ( R ,  0). 


( A  wave  being  specified  by  a  boundary  condition  from  a  fixed  “signal  source”  position. 
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Write  the  characteristic  ODEs  (< dX/dt ,  dY /dt,  dC/dt)  and  solve  these  equations 
with  the  given  initial  conditions. 

(c)  Obtain  the  explicit  Eulerian  solution  c(x,  y,  t). 

(d)  The  boundary  of  the  region  over  which  the  substance  has  spread  remains  circular 
for  all  times;  find  its  radius  and  the  coordinates  of  its  centre  as  functions  of  time. 

2.9  Consider  the  system  of  wave  equations 

Pt  +  5 px  -  lqx  =  0, 
q,  +  2 px  -  4 qx  -  0. 

(a)  Find  the  eigenvalues  and  eigenvectors  for  travelling  waves  in  this  system. 

(b)  Find  the  solutions  p(x,t)  and  q(x,t)  if  the  initial  conditions  at  t  =  0  are 


p(x,  0)  =  5  sin(7v),  q(x ,  0)  =  — 9cos(4v). 


2.10  The  classic  wave  equation  is 

a2^  _  2a2^ 

~C  dx1' 


(2.52) 


It  can  be  used  to  describe  small  transverse  vibrations  of  a  displaced  string  (like  a 
guitar  or  violin  string)  starting  from  initial  conditions  for  the  position  and  velocity 
of  each  point  along  the  string  at  t  =  0: 


<p(x,  0)  =f(x),  4>t(x,  0)  =  g(x). 


(a)  Show  that  this  problem  can  be  written  as  a  system  of  first  order  wave  equations: 

Pt  -  c2qx  =  0,  qt  —  px  =  0, 
where  q  =  <px  and  p  =  <fit. 

(b)  Solve  the  system  to  obtain  p(x,  t)  and  q(x ,  t)  in  terms  of  travelling  waves. 

(c)  Show  that  from  (b)  we  can  write  the  solution  of  the  original  problem  as 

cj)(x,  t)  =  A(x  —  ct)  +  B(x  +  ct). 

(d)  Determine  the  functions  A  (x)  and  B(x)  in  terms  of  the  initial  data,  functions  fix) 
and  g(x).  This  form  of  the  solution  is  called  the  D  Alembert  solution. 
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2.11  The  shallow  water  equations  describe  fluid  flow  in  shallow  (long,  slender) 
layers,  such  as  rivers.  In  simplest  form,  they  are 


dh  dv  dh 

- b  h - h  v —  =  0, 

dt  dx  dx 


dv 

~dt 


dv 

+  xTx 


dh 


where  v(x,  t)  is  the  fluid  speed,  h(x,t )  is  the  height  of  the  fluid  layer,  and  g  is  the 
acceleration  due  to  gravity. 

Show  that  the  shallow  water  equations  correspond  to  the  following  Lagrangian  state¬ 
ments  about  conservation  of  mass  and  change  of  momentum  for  all  fluid  blobs  moving 
with  the  flow: 


Determine  the  function  G(h) . 

2.12  To  completely  specify  the  problem  for  the  determination  of  the  position  of  a 
shock  in  the  Rankine-Hugoniot  equation  (2.48),  we  must  provide  initial  conditions 
on  when/where  the  shock  first  forms,  xs(t *)  =  v*. 

Consider  the  transport  equation  pt  +  q(p)x  =  0  with  initial  condition  p(x,  0)  = 
fix).  Assume  that  q'ip)  >  0  and  fix)  has  a  local  maximum  (for  example,  fix)  = 
£>-*2). 

(a)  Consider  two  characteristic  curves,  x  =  X(t;  Ao)  and  x  =  X(t;  A\),  and  assume 
that  they  carry  different  values  of  the  solution  (Pq  ^  P\).  Determine  the  position, 
vo,  i ,  and  time,  to,  l ,  where  they  will  intersect. 

(b)  Determine  the  time  when  the  shock  first  forms,  t*  in  terms  of  q,f ,  by  minimising 
the  (positive)  intersection  time  over  all  pairs  of  characteristics.  Hint:  Consider 
the  limit  of  two  characteristics  starting  very  close  together. 

2.13  ( Shocks  in  quasilinear  equations)  Consider  the  inviscid  Burgers  equation, 


dp 

~dt 


starting  from  the  initial  conditions 


p(x,  t  =  0) 


9  —  x2  |*|  <3, 
0  \x\  >  3. 


(a)  Use  the  method  of  characteristics  to  construct  the  parametric  solution. 

(b)  Eliminate  the  parameter  from  the  solution  found  in  part  (a)  to  obtain  a  multi¬ 
valued  solution  in  two  parts. 

(c)  Determine  the  time  /*  and  v* -position  where  x(t)  characteristic  curves  first  inter¬ 
sect  (see  Exercise  2.12). 
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(d)  Use  the  results  from  parts  (b,  c)  along  with  the  shock  speed  equation  (2.48)  to 
write  the  ODE  initial  value  problem  for  the  shock  position  xs(t). 

(e)  Write  the  inviscid  Burgers  equation  as  a  conservation  law,  state  the  conserved 
quantity  (with  the  specific  value  set  by  the  above  initial  condition),  and  integrate 
using  the  solution  from  (b)  to  produce  an  algebraic  equation  involving  t  and  v?  ( t ) . 

2.14  Consider  the  (viscous)  Burgers  equation 


dp 

~dt 


0 


subject  to  boundary  conditions 


p(x  ->  —  oo )  =  2, 


oo)  =  1. 


Determine  the  first  order  ODE  satisfied  by  travelling  wave  solutions,  p(x,  t)  =  P(x  — 
ct).  Solve  the  ODE  and  apply  the  boundary  conditions  to  determine  travelling  wave 
profile  P(s)  and  the  wave  speed  c.  Show  that  this  matches  the  shock  speed  that  would 
be  obtained  with  e  =  0,  for  (2.39). 

2.15  (Fully -nonlinear first  order  PDEs )  The  most  general  first-order  PDE  involving 
only  first  derivatives  of  a  solution  p(x,t )  can  be  written  as 


F(p,px,pt,x,  t)  =  0, 


where  F  is  a  given  function.  Let  s  be  a  parametric  variable  and  parametrise  all 
quantities  in  terms  of  s  as 


x  =  X(s)  t  =  T(s)  p  =  P(s)  px  =  R(s)  pt  =  Q(s) 


(a)  Starting  from  P(s)  =  p(X(s),  T  (s))  take  the  derivative  of  P  with  respect  to  s  and 
use  the  chain  rule  to  express  dP/ds  in  terms  of  R ,  Q ,  X' ,  T' . 

(b)  Take  the  ^-derivative  of  F  =  0  using  the  chain  rule  and  then  use  the  result  from 
part  (a)  to  write  the  equation  in  the  form 


dF 

dR 

dX  1 

dQ 

dT  1 

— 

A 

+  B 

+ 

c 

+  D 

ds 

ds 

ds 

ds 

ds 

=o  =o 


Note  that  selecting  X!  =  A,  R'  =  — B ,  T ’  =  C,  Q!  =  —  D  satisfies  the  overall 
equation. 

Obtain  a  system  of  five  autonomous  ODEs  for  X ,  T,  P,  R,  Q  in  terms  of  those 
variables  and  derivatives  of  F. 
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(c)  Determine  the  characteristic  equations  for  the  general  Hamilton- Jacobi  PDE , 

pt  +  H(p,px,x,  t )  =  0, 


where  H  is  a  given  function. 

(d)  Obtain  the  parametric  solution,  X(A ,  t),  P(A,  t)  for  the  problem  for  p(x ,  t)  on 
—oo  <  v  <  oo  and  t  >  0, 


dp 

Yt 


+ 


0)  =  e  x  . 


Chapter  3 

Variational  Principles 


In  many  problems,  we  need  to  find  an  optimal  solution,  one  that  maximises  a  benefit 
or  minimises  a  cost  for  example.  Various  branches  of  science  include  formulations 
based  on  such  principles: 

•  Fermat’s  principle  of  least  time  in  optics 

•  Hamilton’s  principle  of  least  action  in  mechanics 

•  “paths  of  least  resistance”  in  electrostatics,  hydrology,  and  other  areas. 

The  criteria  selecting  an  “optimal”  solution  in  a  physical  system  may  not  always  be 
clear  (especially  in  biological  problems).  However,  if  we  are  given  a  quantity  to  be 
minimised  or  maximised,  the  calculus  of  variations  provides  a  natural  methodology 
for  reformulating  the  question  in  terms  of  a  differential  equation  problem. 

In  this  chapter  we  focus  on  problems  that  require  the  determination  of  a  function, 
say  y*(v),  as  the  optimal  solution  to  a  given  system.  A  classical  example  known  as 
the  brachistochrone  problem  (meaning  “shortest  time”  in  Greek)  is  motivated  by  the 
question:  What  shape  should  a  ramp  take  in  order  to  deliver  a  mass  (moving  under 
the  influence  of  gravity)  to  a  specified  final  position  in  the  least  time?;  Fig.  3.1  shows 
schematics  of  some  possible  trial  solutions. 

Our  approach  will  extend  elementary  methods  from  calculus  for  finding  local 
optimal  points  to  problems  where  solution  functions,  y*(v),  are  optimal  relative  to 
all  small  possible  variations  of  that  function. 


3.1  Review  and  Generalisation  from  Calculus 

Since  our  presentation  will  build  on  the  basic  formulations  for  finding  maxima  and 
minima  from  single-  and  multi- variable  calculus,  we  briefly  review  that  background 
as  a  means  of  introducing  the  terminology  that  we  will  employ. 

Consider  the  problem  of  finding  local  extrema  of  a  smooth  function  y  =  f(x), 
called  the  objective  function.  If f  has  a  local  maximum  at  v  =  v*  with  value  y*  =/(**) 
then  y*  must  be  greater  or  equal  to  all  values  of/  achieved  in  a  small  neighbourhood 
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Fig.  3.1  Three  trial  solutions  for  the  brachistochrone  problem  for  the  shape  of  a  ramp  to  move  a 
mass  down  from  (0,  1)  to  (1,  0)  under  the  influence  of  gravity  as  quickly  as  possible,  y  =  (1  —  x)P 
for  /3  =  1,2,  1/2 


around  x*, 

/(**)  >/(**  +  e) 

Taking  a  Taylor  series  expansion  of/  at  x 

/(**  +  s)  =  /(x*)  +/r  (•**)£  + 

If  the  slope/7  (x*)  were  nonzero,  then  for  |e|  >  0  either/ (x*  +  s)  or /(x*  —  e)  would 
have  a  value  greater  than/(x*),  violating  our  assumption  that/(x*)  is  a  maximum. 
Consequently,  we  must  have/7(x*)  =  0  at  a  maximum.  An  analogous  argument 
applies  for  local  minima  of  y  =f(x),  and  so  local  extrema  for  smooth  functions  can 
only  arise  at  critical  points ,  where 


for  all  \e\  ->  0.  (3.1) 

:  yields 

i/"(x*)e2  +  •  •  •  as  e  ->  0.  (3.2) 


fix*)  =  0. 

Having  eliminated  the  linear  term  from  the  Taylor  series,  the  higher  order  terms  in 
(3.2)  must  also  respect  the  requirement  (3.1).  This  leads  to  the  standard  condition  on 
the  second  derivative  for  a  local  maximum  if"  (x*)  <  0)  (or  minimum  if"  (x*)  >  0)). 
If/"(x*)  =  0  at  a  critical  point,  then  the  next  nonzero  higher  order  terms  need  to  be 
considered  in  order  to  determine  if  the  critical  point  indeed  yields  a  maximum  or  a 
minimum  of/,  or  is  neither  (an  inflection  point). 

Similarly,  for  a  two- variable  objective  function,  having  a  local  maximum  of  /  at 
(x*,  y*)  with  value  z*  =f(x*,y*)  implies  that  all  nearby  points  satisfy 

>/(x*  +ei,y*  +£ 2 )  for  all  |ei|,  \e2\  ->  0. 

The  corresponding  multi- variable  Taylor  series  for  £\,  £2  — >  0  yields 

/(x*  +  £\,y*  +  £ 2 )  =/(^*,y*)  +  +  dyf(x*,y*)£2  H - • 

" - V - " 

[first  derivative  terms] 
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As  in  the  single  variable  case,  the  linear  terms  must  vanish  at  a  local  maximum  or 
minimum,  and  since  this  must  hold  for  all  choices  of  £\  and  s 2 ,  we  arrive  at  the 
generalisation  of  the  previous  critical  point  condition, 

9*/*  =  0,  dyf. *  =  0.  (3.3) 

Namely,  the  first  partial  derivatives  of  the  objective  function  with  respect  to  all  of 
its  independent  variables  must  vanish,  which  can  be  more  compactly  expressed  in 
terms  of  the  gradient 

Y /*  =  0.  (3.4) 

This  definition  of  critical  points  in  terms  of  the  gradient  applies  to  functions  of  any 
number  of  independent  variables.  Being  a  critical  point  is  a  necessary  condition 
for  a  local  extrema,  but  it  is  not  sufficient.  Conditions  on  the  second  derivatives 
(represented  by  the  Hessian  matrix  for  functions  of  two  or  more  variables)  also 
extend  to  arbitrary  numbers  of  variables  for  determining  whether  a  critical  point 
yields  a  maximum,  minimum  or  inflection  point  of  the  objective  function  [24,  68]. 
Whether  we  are  seeking  a  maximum  or  a  minimum,  all  local  extrema  are  determined 
by  the  critical  point  conditions  and  hence  we  focus  on  obtaining  critical  points  of  the 
more  challenging  class  of  problems  that  we  will  consider. 


3.1.1  Functionals 

In  general,  afunctional  is  a  mathematical  expression  that  can  be  evaluated  to  give 
a  scalar  value  corresponding  to  each  function  to  which  it  is  applied.  Two  simple 
examples  are  provided  by  the  integrals  that  correspond  to  the  area  under  a  curve  and 
the  arclength  of  a  curve, 


y(x)dx ,  S(y)  = 

Jo 

for  any  given  function  y(v)  on  0  <  v  <  1. 

For  the  most  part,  we  will  only  consider  functionals  that  are  definite  integrals  of 
a  function  and  its  derivatives, 


J 1  +  y'(x)2  dx, 


rb 

J(y)=  L(x,y(x),y'(x),y"(x), . .  ,)dx.  (3.5) 

J  a 

Here  the  integrand  function  L(x,y ,  . . .)  is  called  the  Lagrangian  (named  after  Joseph 
Lagrange  (1736-1813)  who  reformulated  classical  mechanics  based  on  a  variational 
principle — see  Sect.  3.3). 

For  the  above  geometric  examples  of  area  and  arclength,  the  dependence  of  the 
functional  on  the  solution  is  straightforward,  while  for  the  brachistochrone  problem 
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from  Fig.  3.1,  the  connection  is  a  little  less  direct.  The  time  of  travel  to  be  minimised 
can  be  expressed  as 


ds  f1  / 1  +  (/)2 

—  =  /  - ax 

V  Jo  v(y) 


(3.6) 


where  we  have  used  the  relationship  between  the  speed  and  distance  travelled, 
v  =  ds/dt ,  and  a  further  relationship  is  also  needed  between  the  speed  and  posi¬ 
tion  (see  Exercise  3.8). 

Now  consider  the  minimisation  of  an  objective  functional,  by  analogy  with  the 
previous  analysis,  we  see  that  for  J(y)  to  have  a  local  minimum1  for  a  particular 
function  y*  (x)  the  functional  must  satisfy 


«%*(*))  <  J(y*(x)  +  eh(x)), 


for  all  s  ->  0  and  for  all  admissible  “perturbation  functions”  h(x)  of  y*(v).  For  the 
moment,  h(x)  is  unspecified  and  serves  to  describe  a  neighbourhood  of  the  optimal 
solution  y*(x)  within  the  space  of  smooth  functions.  While  there  are  many  technical 
issues  underlying  these  concepts  that  are  deserving  of  careful  analysis,  we  defer 
detailed  considerations  to  more  advanced  texts  [71,  99]. 

We  will  describe  a  straightforward  procedure  (as  an  extension  of  the  process  of 
finding  critical  points  of  an  objective  function)  for  determining  optimal  solutions  of 
a  functional.  In  particular,  we  will  have  to  explain  what  it  means  to  take  a  derivative 
of  a  functional  with  respect  to  a  function,  and  we  will  distinguish  these  from  standard 
derivatives  by  using  the  terminology  variational  derivatives. 


3.2  General  Approach  and  Basic  Examples 

Given  an  objective  functional,  there  is  a  systematic  four-step  approach  which  can  be 
used  to  obtain  smooth  solutions  that  correspond  to  local  extrema  of  the  functional: 

(i)  Assume  the  existence  of  an  optimal  solution  y*(v)  and  perform  an  expansion 
of  the  functional  in  the  neighbourhood  of  y*  (x)  using 

y  =  y*(x)  +  eh(x)  J  =  J(y),  (3.7) 

where  the  perturbation  function  h(x)  is  independent  of  s.  Formally  expanding 
J  as  £  — >  0  like  a  Taylor  series  yields 


J  —  J  (y*  +  sh)  —  J  (y*)  + 


dJ 

dy 


9? 


(eh)  +0(s2),  (3.8) 


l;y* 


lrThe  behaviour  at  a  local  maximum  of  J(y)  follows  similarly. 
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where  we  still  have  to  precisely  define  the  notion  of  a  variational  derivative  (in 
this  case  the  evaluation  of  the  derivative  of  an  integral  with  respect  to  a  function). 
The  O(s)  term-  in  the  Taylor  series  of  J  will  be  called  th e  first  variation ,  the 
0(s2)  term  the  second  variation  and  so  on. 

(ii)  Apply  the  critical  point  condition:  the  first  variation  of  the  functional  must  be 
zero  at  the  solution  y*(v)  for  all  admissible  perturbations  h(x), 

dJ 
dy 

We  will  explain  the  meaning  of  ‘admissible’  solutions  and  perturbations  below. 

(iii)  Assuming  the  solution  to  be  smooth,  convert  the  critical  point  condition  on  the 
functional  into  a  differential  equation  for  the  optimal3  solution  y *  (v) . 

(iv)  Solve  the  differential  equation  problem. 

The  details  of  this  approach  will  be  discussed  and  expanded  out  in  following  exam¬ 
ples.  This  framework  applies  to  many  more  complicated  problems.  However,  an 
important  disclaimer  is  that  not  every  problem  can  be  simplified  to  a  differential 
equation  using  this  framework,  those  that  can  be  are  called  variational  problems.  A 
number  of  difficulties  can  arise  to  make  a  problem  non- variational,  such  as  the  func¬ 
tional  not  having  critical  points,  or  it  not  being  possible  to  reduce  the  first  variation 
to  a  differential  equation. 


5? 


{eh)  =  0 


iy» 


(3.9) 


3.2.1  The  Simple  Shortest  Curve  Problem 

We  begin  by  considering  the  simple  problem  of  finding  the  function  y  =  y(x)  that 
gives  the  shortest  path  from  the  origin  (0,  0)  to  the  given  fixed  point  (1 ,  b),  b  e  R,  see 
Fig.  3.2.  While  we  know  the  answer  to  be  the  straight  line  connecting  the  two  points, 
it  will  be  instructive  to  see  how  the  calculus  of  variations  framework  constructs  this 
result. 

The  shortest  path  is  the  one  that  minimises  the  total  arclength  and  hence  our 
objective  functional  is 

J(y)  =  f  J 1  +  (v')2  dx.  (3.10a) 

Jo 


2The  0(sn )  order  symbol  will  be  defined  precisely  in  Chap.  6,  but  for  the  current  context  we  use 
this  to  refer  to  the  s  ->  0  limit  of  the  remainder  of  the  terms  in  the  series  with  coefficients  sN  for 
N  >  n. 

3  Showing  that  a  solution  given  by  a  critical  point  is  a  local  maximum  or  minimum  involves  evaluating 
the  second  variation  at  y*  (jc)  . 
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Fig.  3.2  An  admissible  trial 
solution  y{x)  for  problem 
(3.10a,  3.10b)  and  its 
perturbation  from  the 
optimal  solution  y*  (jc) 


y 


The  statement  of  the  problem  identifies  boundary  conditions  that  all  solutions  must 
satisfy, 

y(0)=0,  y(l)  =  b.  (3.10b) 

Analysis  of  the  problem  begins  by  identifying  the  properties  of  the  family  of  per¬ 
turbations  h(x)  to  the  optimal  solution  y*  (x)  that  define  all  admissible  trial  solutions 
(3.7),  y(x)  =  y*  (x)  +  sh(x).  Applying  boundary  conditions  (3.10b)  to  the  admissible 
solution  y  yields  that 

y(0)  =  y*(0)  +  eh(  0)  =  0,  y(l)  =  y*(l)  +  sh(l)  =  b. 

Since  the  boundary  conditions  on  the  optimal  solution  are  necessarily  the  same  as 
those  on  the  trial  solutions  (y*  (0)  =  0  and  y*  ( 1 )  =  b),  we  obtain  that  the  perturbation 
functions  must  satisfy 

h(0)  =  0,  h(l)=  0.  (3.11) 


Next,  we  substitute  (3.7)  into  (3.10a)  giving 

J  =  [  yj  1  +  (/*  +  e/i')2  dx  =  [  J 1  +  (y^)2  +  2sh%  +  e2(/i/)2  dx. 

Jo  Jo 

For  investigating  the  limit  s  — >  0,  it  is  convenient  to  factor  the  integrand  as 


2  eh'y'* 

i  +  ty'*)2 


s2(h')2  \ 

l  +  (/*)2/ 


dx 
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and  then  using  the  Taylor  series  expansion  ffl  +  z  =  l  +  ^z  +  6>(z2)asz^  0 
applied  to  the  second  factor  allows  us  to  write 

Mi'  + >  (/'  yrfb  *) + 0<£2)-  <312) 

The  first  term  in  this  expansion  gives  the  expression  for  the  functional  at  the  optimal 
solution,  /*  =  /(y*),  but  since  y*  is  unknown,  in  order  to  evaluate  this  integral,  we 
must  examine  further  terms.  The  0(e)  term  provides  the  first  variation  (3.8)  that  will 
determine  the  critical  points. 

It  is  important  to  note  that  the  form  of  the  first  variation  in  (3.8)  was  chosen  to 
intentionally  suggest  that  the  perturbation  function  h(x )  should  appear  linearly  and 
undifferentiated.  We  will  always  seek  to  put  the  first  variation  term  into  this  form. 

For  (3.12),  getting  to  this  form  can  be  accomplished  by  performing  integration 
by  parts  to  shift  the  derivative  off  of  the  h!  factor, 


Vi  +  (V)2 


x=l 


Vl  +  ov2 


x=0 


Vl  +  (V*)2 


hdx\ 


integration  by  parts  will  be  a  fundamental  calculational  tool  in  most  of  our  problems. 
The  boundary  conditions  on  h  given  by  (3.11)  eliminate  the  contributions  from  the 
boundary  terms,  and  leave  the  final  form  of  the  first  variation  of  the  functional  as  a 
single  integral 


8J 

8y 


Vi  +  (V*)2 


hdx, 


(3.13) 


where  we  use  the  8  -notation  to  indicate  that  this  is  a  variational  (functional)  derivative 
(also  sometimes  called  a  Frechet  derivative). 

The  next  step  is  to  enforce  the  critical  point  condition  (3.9)  on  (3.13)  for  all 
admissible  perturbations.  At  this  point,  we  need  a  basic,  but  important  result  from 
analysis  called  the  Fundamental  Lemma  of  the  Calculus  of  Variations  which  states 
that 


If 


g(x)h(x)  dx  =  0  Vh(x)  then  g(v)  =  0on  a  <  x  <  b 


(3.14) 
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In  other  words,  if  the  integral  of  a  product  of  functions  is  zero  for  all  choices  of  one 
of  the  factors,  then  this  is  only  possible  if  the  other  factor  is  identically  zero  on  the 
whole  interval  of  integration.4,5 

Applying  the  fundamental  lemma  to  the  critical  point  condition  for  (3.13), 


d 


y 


* 


o  dx  \  yi  +  (y*)2 


hdx  =  0, 


for  all  possible  h,  we  obtain  the  differential  equation  for  y*(v) 


d 


y 


* 


dx  \\/l  +  (y*)2 


=  0. 


(3.15) 


(3.16) 


We  have  consequently  replaced  an  integral  condition  (the  weak  form,  that  applies 
over  the  whole  domain)  with  an  equivalent  differential  equation  (the  strong  form)  that 
applies  pointwise  on  smooth  solutions.  Together  with  boundary  conditions  (3.10b), 
(3.16)  provides  us  with  a  complete  ODE  boundary  value  problem  defining  the  optimal 
solution  y*(v). 

In  this  problem  it  is  straightforward  to  integrate  the  ODE  once  yielding  a  constant 
of  integration  C\,  then  following  some  algebra, 


yi  +  (/*)2 


0O2  =  =  Q2  >  0,  (3.17) 

1  cl 


for  some  constant  C* .  Then  using  the  boundary  conditions  (3. 10b)  yields  y*  (v)  =  bx, 
confirming  that  the  shortest  path  is  a  straight  line. 

A  second  derivative  test  can  be  applied  to  verify  that  this  solution  is  a  local 
minimum  of  (3.10)  (see  Exercise  3.2),  but  here  we  can  use  an  understanding  of  the 
structure  of  the  problem  to  show  that  it  is  indeed  the  solution  we  seek.'  In  many 
cases,  problem- specific  intuition  can  be  used  to  distinguish  whether  critical  point 
solutions  minimise  or  maximise  the  functional. 

A  more  general  version  of  this  problem,  allowing  for  solutions  as  parametric 
curves  in  the  plane  is  given  in  Exercise  3.3. 


4In  this  simplified  statement  of  this  result,  we  are  assuming  that  g(x)  is  smooth  and  hence  has  no 
discontinuities. 

5The  du  Bois  Reymond  lemma  (2.11)  is  closely  related  to  this  lemma,  choosing  h{x)  =  1  on 
arbitrary  sub -intervals,  and  otherwise  h  =  0. 

6For  the  area  and  arclength  examples,  the  functionals  become  unbounded  for  large  amplitude 
solutions  and  hence  there  are  no  local  maxima. 
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3.2.2  The  Classic  Euler-Lagrange  Problem 

Consider  an  objective  functional  defined  in  terms  of  an  integral  on  the  interval  a  < 
x  <  b  with  an  integrand  that  is  a  function  of  the  independent  variable  x,  the  solution 
y(v)  and  its  first  derivative  y'(x) 


rb 

J 00  =  /  L(x,  y,  y  )  dx, 

J  a 

subject  to  prescribed  boundary  conditions  on  y  given  by 


(3.18a) 


y(a)  =  c,  y(b)  =  d. 


(3.18b) 


Parallelling  the  previous  example,  we  find  that  admissible  perturbation  functions 
must  satisfy  the  homogeneous  boundary  conditions 


h(a)  =  0,  h(b)  =  0. 


(3.19) 


The  expanded  form  of  /(y*  +  sh)  is  given  by 


•b 


J  =  /  L(x,  y*  +  sh,  y^  +  sh')  dx 


a 


and  by  applying  the  multi- variable  Taylor  series,  this  can  be  written  for  s  — >  0  as 


J  = 


a 


( 


,  tdL  dL  , B  , 

L(x,  y*,  y*)  +  s  (  —h  +  I  I  +  0(£  ) 


■) 


dx 


dL 


dL 


—  /*  +  £  1  /  ——hdx-\- 

!a  Ja 


h'  dx  )  +  0(sz). 


In  order  to  address  the  hf  derivative  factor  in  the  third  term,  we  apply  integration  by 
parts  to  obtain 


a 


b  dL  ,  3 L 

— h  dx  =  — h 
dyf  dy ' 


x=b  rb 


x=a  J  a 


d 

dx 


is 


hdx. 


Imposing  the  boundary  conditions  (3.19)  on  the  perturbation  function,  we  can  then 
re-group  the  first  variation  as 


•ft  r 


J  —  J*  H-  s 


a 


dL  d 


_  dy  dx 


m 


hdx  +  0(sz), 


(3.20) 
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This  was  an  important  and  necessary  step  because  in  order  to  apply  the  fundamental 
lemma  (3.14)  to  the  critical  point  condition,  we  must  have  the  first  variation  expressed 
as  a  single  integral  having  the  perturbation  h(x)  as  a  factor, 


b  \dL  d 


a 


_  3  y  dx 


m 


hdx  =  0. 


Since  this  holds  for  all  admissible  choices  of  h(x),  by  (3.14)  we  obtain  the  differential 
equation  on  a  <  x  <  b, 


(3.21) 


Subject  to  the  boundary  conditions  (3.18b),  (3.21)  defines  a  differential  equation 
problem  that  should  have  a  unique  solution  for  y  =  y*(v). 

Equation (3.21)  is  known  as  the  Euler-Lagrange  equation  for  the  functional 
(3.18a).  In  Sect.  3.3,  we  will  see  that  this  particular  form  is  used  extensively  in 
problems  for  mechanical  systems. 

A  very  useful  consequence  of  the  general  form  of  L  in  (3.18a)  is  that  the  Euler- 
Lagrange  differential  equation  for  any  functional  with  L  =  L(x,  y,  y')  is  given  by 
substituting  the  specific  L  into  (3.21).  For  example,  the  ODE  (3.16)  for  the  shortest 
distance  problem  (3.10a)  is  produced  by  (3.21)  with  L  =  y/i  +  ( y' )2.  Similarly,  if 
our  Lagrangian  were  L(x ,  y,  /)  =f(x)y  +  g(x)y'  +  (/)2,  then 


3  L 
3  y 


=  /(*)> 


3  L 

3 y 


=  g(x)  +  2 /, 


=  /(*)  +  2/ 


and  (3.21)  gives  the  Euler-Lagrange  equation  for  y*(v)  as 


f(x)  -  g\x)  -  2 


3.3  The  Variational  Formation  of  Classical  Mechanics 

One  of  the  most  wide-spread  applications  of  the  calculus  of  variations  is  in  deriv¬ 
ing  the  equations  of  motion  for  mechanical  systems  from  the  critical  points  of  a 
functional.  The  theory  is  based  on  a  restatement  of  the  previous  Euler-Lagrange 
results — we  relabel  the  variables  to  describe  the  motion  of  a  mass: 
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y{x) 
y\x) 
L(x,  y,  y') 


t 

y(t ) 
y\t) 

L(t,y,yf ) 


J  —  I  L  dx  —>  I  =  I  Ldt 


:  independent  variable  (time) 
:  solution  (position) 
:  derivative  (velocity) 
:  still  called  the  Lagrangian 

:  functional  (action) 


The  Euler-Lagrange  equation  (3.21)  now  takes  the  form 

dL  d  /dL\  _ 
dy  dt  \9// 


(3.22) 


and  the  key  step  in  the  application  to  mechanics  is  in  identifying  the  Lagrangian 
function  L  as  the  difference  between  the  kinetic  energy  T  and  potential  energy  V 


L  =  T  -V. 


(3.23) 


William  Hamilton  (1805-1865)  used  this  form  of  Lagrangian  based  on  the  assump¬ 
tion  that  mechanical  systems  should  satisfy  what  come  to  be  called  Hamilton  ’s  prin¬ 
ciple  of  least  action ,  namely  that  the  optimal  (actual,  observable)  solution  y*(t)  will 
be  the  one  that  minimises  the  action  integral,  I. 

A  very  simple  example  of  a  mechanical  system  is  the  vertical  motion  of  a  rigid 
object  with  mass  m  subject  to  gravity  g.  The  kinetic  energy  of  the  point  mass  is 
T  =  \  m(y')2,  while  the  potential  energy  due  to  gravity  is  given  by  V(y)  =  mgy. 
Substituting  this  form  for  L(t ,  y,  y')  (3.23)  into  the  Euler-Lagrange  equation  (3.22) 
yields 

dV  d(y')  d2y  dV  d2y 

- m — 1 —  =0  =>  m — yr  = - =>  m — ^  =  —mg, 

dy  dt  dt 2  dy  dt 2 

exactly  as  would  be  obtained  from  Newton’s  second  law,  but  without  the  need  to 
explicitly  work  out  the  forces  acting  on  the  system.  The  difference  between  the 
Lagrangian  and  force-based  approaches  is  not  significant  in  this  example,  but  for 
more  complicated  systems,  the  Euler-Lagrange  approach  can  notably  simplify  the 
process  of  deriving  the  equations  of  motion. 

Thorough  treatments  of  the  implications  of  this  formulation  are  given  in  books 
on  intermediate  and  advanced  mechanics  [40,  62,  67];  we  limit  ourselves  here  to  the 
basic  concepts  that  are  most  immediately  useful  for  the  formulation  of  models  for 
mechanical  systems. 
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3.3.1  Motion  with  Multiple  Degrees  of  Freedom 


One  generalisation  of  the  Euler-Lagrange  equation  of  motion  is  the  derivation  of  the 
equations  of  motion  for  a  multi- variable  systems.  As  a  specific  example,  consider 
the  motion  of  a  projectile  in  two  dimensions  with  position  x  =  (. x(t ),  y(t))  subject  to 
gravity  acting  in  the  y-direction.  The  action  can  be  written  in  terms  of  the  difference 
of  the  kinetic  and  potential  energies, 


dt. 


The  principle  of  least  action  then  requires  that  I(x,y )  is  minimised  over  all  possible 
choices  of  the  unknown  functions  x(t )  and  y(t).  We  assume  that  an  optimal  solution 
(x*(0>  y*(0)  exists  and  express  all  admissible  solutions  in  terms  of  independent 
perturbations  of  the  unknowns 


x(t)  =  x*0)  +  eh\(t),  y(t)  =  y*0)  +  sh2(t). 


Substituting  into  the  action,  I  (x,  y)  and  Taylor  expanding  in  the  limit  s  ->  0, 


I  —  /*  +  £ 


\m(x'ji\  +  y')„h,2)  -  mgh2 \  dt  +  0(s2). 


By  application  of  integration  by  parts  and  eliminating  the  boundary  terms,  we  obtain 
the  O(s)  term  as 


J  (mx’f)h\  +  (my"  +  mg)h2  dt. 


Satisfying  the  critical  point  condition  implies  that  this  integral  must  be  equal  to  zero 
for  all  choices  of  h\(t)  and  One  class  of  possibilities  is  given  by  =  0  and 
h\ (t)  arbitrary,  this  yields  the  equation  rax"  =  0.  Another  choice  ish\  =0  with  Ii2(t) 
arbitrary,  yielding  mg  +  my"  =  0;  the  overall  result  comes  from  the  intersection  of 
these  cases  using  the  linear  independence  of  the  individual  perturbations. 

This  can  be  understood  as  a  generalisation  of  fundamental  lemma  (3.14)  for  the 
dot  product  of  vector  functions,  g(v),  h(x)  e  M;\ 

If  f  g(v)  •  h(x)  dx  =  0  Vh(v)  then  g(v)  =  0  on  a  <  v  <  b.  (3.24) 
J  a 

In  other  words,  if  the  value  of  the  integral  is  zero  for  all  choices  of  component 
perturbation  functions  in  h  then  each  component  of  g  must  be  identically  zero, 
gi(x)  =  0  for  i  =  1 ,  2,  . . . ,  n. 
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Hence  we  can  arrive  at  the  (general)  result  that  the  critical  point  condition  will  yield 
separate  Euler-Lagrange  equations  with  respect  to  each  variable  with  an  independent 
perturbation.  In  this  case 


(3.25a) 


(3.25b) 


Thus  systems  of  differential  equations  for  individual  particles  moving  in  two  or  three 
dimensions  (x  =  (x,y,z)),  or  sets  of  particles  (xi,  X2,  . . .),  can  be  obtained  from 
the  Lagrangian  for  the  complete  system,  L(t,x i,  X2,  . . . ,  x'1?  xf2,  . .  .)>  through  the 
application  of  the  Euler-Lagrange  equation  to  each  independent  unknown  function. 


3.4  The  Influence  of  Boundary  Conditions 

In  the  previous  examples,  we  have  seen  the  important  roles  played  by  the  boundary 
conditions  in: 

(i)  specifying  boundary  conditions  for  the  Euler-Lagrange  problem 

(ii)  determining  boundary  conditions  on  the  perturbation  functions 

(iii)  eliminating  the  boundary  terms  generated  by  integration  by  parts  in  the  calcu¬ 
lation  of  the  first  variation 

The  last  point  is  perhaps  the  most  important  in  terms  of  reducing  the  problem  to 
a  differential  equation;  if  it  is  not  possible  to  eliminate  the  boundary  terms,  then 
the  fundamental  lemma  cannot  be  applied.  Choices  of  boundary  conditions  that  are 
compatible  with  this  requirement  are  called  natural  boundary  conditions. 

So  far,  we  have  seen  how  to  work  with  Dirichlet  boundary  conditions  (e.g. 
y(a)  =  c).  We  will  now  consider  how  other  types  of  boundary  conditions  affect 
problems. 


3.4.1  Problems  with  a  Free  Boundary 

Consider  a  modification  of  the  problem  of  determining  the  shortest  path  we  consid¬ 
ered  in  Sect.  3.2.1:  find  the  function  y(x)  passing  through  the  origin  that  gives  the 
shortest  path  to  the  vertical  line  x  =  1 . 

Being  a  problem  in  terms  of  the  minimum  distance,  the  functional  to  be  minimised 
is  the  same  as  in  (3.10a), 


7 Given  the  lack  of  elementary  approaches  if  the  fundamental  lemma  can  not  be  used,  it  is  tempting 
to  call  them  necessary  boundary  conditions. 
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J(y)  =  [  J l  +  (/)2  dx, 

Jo 

but  the  problem  statement  now  provides  only  one  definite  boundary  condition, 
y(0)  =  0.  There  is  no  specific  requirement  on  y  at  v  =  1  and  hence  this  is  called  a 
free  boundary.  In  order  to  see  how  to  deal  with  such  a  situation,  we  revisit  (3.7)  to 
consider  the  form  of  admissible  solutions, 


y(x)  =  y*(x)  +  eh(x). 


At  v  =  0,  we  should  have  zero  perturbation,  h( 0)  =  0  since  y(0)  =  y*(0)  =  0.  At 
x  =  1,  y(l)  and  y*(l)  are  arbitrary,  and  so  we  have  no  information  about  h(  1). 
Proceeding  as  before,  we  expand  /(y*  +  eh)  for  e  — >  0  as 


J  —  J*  £ 


y'h 


yi  +  oo2 


x=l 


x=0 


J 


* 


A  +  (/*)2 


+  0(s2). 


We  need  the  total  contribution  from  the  boundary  terms  to  vanish,  requiring 


yrrw 


where  the  x  =  0  boundary  term  vanishes  due  to  h(0)  =  0.  Since  we  do  not  know 
anything  about  h(  1),  in  order  to  guarantee  that  the  remaining  term  vanishes,  we 
require  that  the  optimal  solution  satisfies  the  natural  boundary  condition, y*(l)  =  0. 

It  is  important  to  note  that  irrespective  of  the  form  taken  by  the  natural  boundary 
conditions,  they  are  only  involved  in  eliminating  the  boundary  terms  in  order  to  leave 
the  integral  in  a  form  compatible  with  the  fundamental  lemma.  Consequently,  the 
Euler-Lagrange  equation  is  again  given  by  (3.21)  and  (in  this  case)  leads  to  the  same 
ODE  as  found  for  the  prescribed  end-point  case  (3.16).  However,  the  new  boundary 
conditions  for  the  Euler-Lagrange  problem,  y*(0)  =  0  and  y  *  ( 1 )  =  0,  now  select 
a  different  solution,  y*  (v)  =  0  (corresponding  to  the  line  from  the  origin  along  the 
v-axis  up  to  the  point  of  intersection  with  the  line  v  =  1). 


3.4.2  Problems  with  a  Variable  Endpoint 

Another  version  of  the  “find  the  shortest  path”  problem  is  to  find  y(x)  that  gives  the 
shortest  path  from  the  origin  to  a  given  curve  y  =f(x)  (see  Fig.  3.3). 

The  boundary  condition  y*  (0)  =  0  holds  as  before,  but  now  the  other  end-point 
can  be  any  point  on  the  curve  y  =f(x).  Let  us  denote  this  unknown  point  as  x  —  b 
and  then  the  boundary  condition  is  simply 


y(b)  =f(b). 


(3.26a) 
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Fig.  3.3  A  trial  solution,  y 

y(x)  on  0  <  x  <b,  for  the  j 

“shortest  path  to  a  given 
curve ”  variable  endpoint 
problem  (3.26) 


The  objective  functional  is  (as  before)  the  total  arclength 

rb 

J(y )  = 

Jo 

We  begin  the  analysis  as  usual,  by  adding  a  perturbation  to  each  unknown ,  y(v)  and 
b.  As  before,  the  admissible  solutions  take  the  form 

y(x)  =  y*(x)  +  sh(x), 

while  the  admissible  endpoints  can  be  written  as 

b  =  b*  +  sc, 


y  l  +  Cy')2  dx. 


(3.26b) 


where  c  is  a  perturbation  constant.  Consequently,  we  have 


'b*+ec 


1  +  Cy*  +  shr)2  dx. 


(3.27) 


At  this  point  in  previous  examples,  we  have  Taylor  expanded  J  in  the  limit  s  —>  0.  A 
new  complication  present  in  this  problem  is  that  a  perturbation  appears  in  the  limits 
of  integration  as  well  as  in  the  integrand.  One  effective  tool  for  dealing  with  this 
issue  is  Leibniz’s  rule  for  the  derivative  of  an  integral 


d 

dz 


b{z) 


g(x,y(x),z)dx 


a(z) 
b 


a 


dg 

dz 


dx  + 


db  da 

g(b,  y(b),  z)—  -  g(a,y (a),  z)  — 
dz  dz 


(3.28) 
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Applying  Leibniz’s  rule  to  (3.27)  with  s  playing  the  role  of  z,  we  obtain  the  first 
variation  in  the  form 


dJ 

ds 


y'*ti 


s=o  7o  yr+  (y^)2 


dx  +  cJ  1  +  /*(Z?*)2 


After  using  integration  by  parts  and  imposing  the  critical  point  condition,  this  yields 


(Vi  +y'Jb«)2  + 


y*(b*)h(b*)  \  _  rb*  d_  {  \ 

vi  +y;(^)2/  Vo  ^vyr+w/ 


hdx  =  0. 


If  the  boundary  terms  vanish,  then  applying  the  fundamental  lemma  to  the  integral 
will  yield  the  expected  Euler-Lagrange  equation,  but  it  is  not  yet  clear  which  bound¬ 
ary  conditions  on  y  achieve  this  since  c  and  h(b *)  are  unknown.  However,  we  have  not 
yet  fully  utilised  (3.26a),  namely  that  y(b)  =  f(b).  Writing  this  out  more  explicitly, 
we  have 

y*(b*  +  sc)  +  sh(b *  +  sc)  =  f(b*  H-  sc).  (3.29) 


Expanding  (3.29)  in  the  limit  s  — >  0,  matching  first  terms  in  the  respective  expansions 
of  the  left-  and  right-hand  sides,  we  recover  y*(b*)  =f(b *)  for  the  optimal  solution. 
Matching  O(s)  terms,  we  find  that 

+  h(b*)  =f(b*)c 


which  can  be  solved  for  c 


h(b,) 

/'(**) -y*(**)' 


Substituting  this  result  into  the  boundary  terms  yields 


y*(b*) 

yfi  +  y*(b*)2 


+ 


a/1  +  ^i(^*)2  \ 


h(b*). 


Since  no  constraints  are  known  on  the  value  of  h(b *),  in  order  to  ensure  that  this  term 
vanishes  for  all  admissible  perturbations,  we  require  that  the  terms  in  parentheses 
sum  to  zero,  which  (after  a  little  algebra)  yields  the  natural  boundary  condition 

y'(b*)  =  ~/W  (3J0) 

Recalling  that  the  Euler-Lagrange  equation  yields  straight  lines  as  solutions  (3.17), 
this  condition  selects  the  minimum  distance  to  the  curve  via  the  line  that  inter¬ 
sects  the  curve  perpendicularly  (recall  the  negative  reciprocal  slope  relation  from 
analytic  geometry).  While  it  may  seem  unusual  to  have  three  boundary  conditions 
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(on  y  (0) ,  y  (b) ,  /  ( b ))  for  a  second  order  differential  equation,  it  is  not  overdetermined 
since  b*  is  unknown  and  has  to  be  determined  as  part  of  the  solution. 


3.5  Optimisation  with  Constraints 

While  the  solutions  of  many  systems  are  determined  from  optimising  some  prop¬ 
erty  of  the  system,  in  some  cases,  the  choice  of  the  optimal  solution  is  constrained 
further  by  some  implicit  structure  in  the  problem — such  as  a  finite  amount  of  build¬ 
ing  materials  limiting  the  maximum  size  of  a  structure,  or  finite  time  limiting  how 
thoroughly  a  task  can  be  performed.  These  are  examples  of  constrained  optimisation 
problems — the  specific  constraints  are  crucial  in  selecting  the  relevant  solution.  In 
simple  problems,  the  constraints  can  be  substituted  directly  into  the  objective  func¬ 
tion  in  order  to  obtain  a  reduced  problem  describing  all  achievable  solutions  (see 
Exercise  3.5,  for  example),  but  for  problems  where  this  cannot  be  done,  the  method 
of  Lagrange  multipliers  can  be  employed  instead.  This  approach  will  be  used  to  inves¬ 
tigate  several  classes  of  calculus  of  variations  problems  in  the  remaining  sections  of 
this  chapter, 

(i)  Isoperimetric  problems:  optimising  a  functional  subject  to  a  condition  on  an 
integral  of  the  solution. 

(ii)  Holonomic  systems:  optimising  a  functional  subject  to  a  geometric  condition 
applied  pointwise  on  the  solution. 

(iii)  Optimal  control:  optimising  a  functional  subject  to  a  differential  equation 
applied  pointwise  on  the  solution. 

We  begin  with  a  brief  review  of  the  method  of  Lagrange  multipliers  applied  to 
problems  from  multivariable  calculus. 


3.5.1  Review  of  Lagrange  Multipliers 


In  constrained  optimisation  problems,  feasible  solutions  lie  within  a  subset  of  the 
space  of  all  possible  solutions.  The  method  of  Lagrange  multipliers  identifies  the 
feasible  solutions  through  the  same  process  used  in  unconstrained  optimisation 
problems — namely  obtaining  the  possible  solutions  from  solving  equations  deter¬ 
mined  by  the  critical  point  condition  (cf.  (3.4)) 


3  (objective) 


3  (variables) 
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The  Lagrange  multiplier  approach  constructs  an  “augmented  objective  function”  ££ 
incorporating  the  original  objective  function,  call  it .f(x,  y),  along  with  all  constraints 
so  that  the  critical  points  are  still  described  by  a  gradient  condition 

VJzf  =  0  all  critical  point  solutions.  (3.31) 

Consider  the  fundamental  problem  of  maximising  a  function  of  two  variables  subject 
to  a  single  constraint: 

Find  (jc*,  ;y*)  yielding  ma x/(jc,  y)  from  among  the  points  (jc,  y)  on  the  curve  g(x,  y )  =  0, 

where  g(x,  y)  =  0  is  the  implicit  equation  of  a  given  curve. 

Suppose  that  the  parametric  equations  describing  the  g  =  0  curve  are  known, 
x  =  x(t),y  =  y(t),  such  that 


g(x(t),y(t))  =  0  Vf. 

Differentiating  this  equation  using  the  chain  rule  yields 

0=jt[g(x(t),y(t))]  =  Vg-^=0.  (3.32) 

If  the  parameter  t  is  taken  to  represent  time,  the  geometric  interpretation  of  this 
equation  is  that  the  “velocity”  vector  dx/dt  (which  is  tangent  to  the  curve  x(f))  is 
perpendicular  to  the  level-curve  g  =  0  since  the  gradient  Vg  is  orthogonal  to  contours 
of  constant  function- value. 

Substituting  x(t),  y(t )  into  f(x,  y)  yields  a  function  of  a  single  variable 


F(t)  =f(x(t),y(t)) 


that  is  parametrised  along  the  curve  g  =  0.  Finding  the  critical  points  of  F(t)  with 
respect  to  t  gives  the  critical  points  of  /  on  g. 


dF 

dt 


dF  dx 


(3.33) 


Hence,  the  chain  rule  shows  that  at  a  critical  point,  V/  is  also  orthogonal  to  the  velocity 
vector.  In  two  dimensions,  this  forces  the  two  gradients  to  be  a  scalar  multiple  of 
each  other  and  then  we  obtain 


v/  =  AVg 


where  X  (called  the  Lagrange  multiplier)  is  another  unknown  that  needs  to  be  deter¬ 
mined.  In  component  form,  the  resulting  set  of  equations  of  three  variables  in  the 
three  unknowns  is 


fx(x*,y *)  =  Xgx(x*,y*),  fy(x*,y*)  =  Agv(v*,;y*),  g(v*,y*)  =  0. 
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A  typical  solution  strategy  for  solving  the  equations  is  to  first  solve  the  V/  =  XV  g 
equations  to  obtain  x(X),y(X),  and  then  substitute  these  into  g(v(A),  y(A))  =  0  to 
obtain  a  final  equation  for  the  values  of  X. 

This  approach  generalises  in  a  straightforward  manner  to  higher  dimensions  and 
to  multiple  constraints  («5f  =  /  —  Xg  —  jih  —  •  •  • )  [24,  68].  We  note  that  had  the 
parametric  equations  for  x(t),y(t)  been  available,  then  solutions  could  have  been 
obtained  directly  from  (3.33).  The  Lagrange  approach  does  not  actually  use  these 
parametric  equations,  just  the  assumption  of  their  existence,  to  reformulate  the  prob¬ 
lem  into  a  convenient  form. 

The  above  discussion  motivates  the  introduction  of  the  augmented  (constrained) 
Lagrange  objective  function  satisfying  (3.31), 

^f(x,  y,  X)  =f(x,  y)  -  Xg(x,  y).  (3.34) 

This  is  just  a  convenient  form  that  reproduces  the  set  of  equations  above  as  the  critical 
point  equations  for  ££  with  respect  to  its  three  variables, 

VJ£?  =  0  **  { =  0,  =  0,  J&=0}.  (3.35) 


3.6  Integral  Constraints:  Isoperimetric  Problems 

Consider  problems  where  the  solution  y*  (x)  satisfies 

max  (  J=  L(x,  y,  y)  dx  )  subject  to  G  =  /  g(x,  y,  y)  dx  =  0. 
y  \  Ja  )  Ja 

(3.36) 

These  are  sometimes  called  isoperimetric  problems  with  the  name  referring  back  to 
a  classic  geometric  problem  of  maximising  the  area  enclosed  by  a  curve  constrained 
to  have  a  fixed  perimeter  (see  Exercise  3.20). 

Maximising  J  subject  to  the  constraint  G  =  0  can  be  expressed  in  terms  of  an 
augmented  Lagrange  functional  of  the  form 


I(y,  X)=j-  XG ,  (3.37) 

where  A  is  a  constant.  This  can  also  be  restated  in  terms  of  an  augmented  Lagrangian 
function 

rb 

/=  /  dx  where  £T(x,  y,y,  X)  =  L(x,y,  /)  —  Xg(x,y,  /).  (3.38) 
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Once  this  functional  has  been  identified,  we  follow  the  standard  variational  process 
described  in  Sect.  3.2,  starting  with  the  introduction  of  perturbations  to  all  of  the 
unknowns 

y(x)  =  y*(x)  +  sh(x),  X  =  A*  +  sy. 


Expanding  I  as  a  Taylor  series  for  s  —>  0  and  enforcing  the  critical  point  condition 
on  the  0(e)  term  yields 


>b  r 


a 


dL  d 


3  y  dx 


(£) 


-X 


* 


dg  _  d_ 
3  y  dx 


(0) 


hdx  +  y  g(x,y*,y')dx  =  0 


a 


Requiring  this  to  hold  with  respect  to  (i)  all  possible  y  perturbations  recovers  the 
geometric  constraint 


g(x,y*,y*)dx  =  0, 


(3.39a) 


and  (ii)  all  h(x)  perturbations  using  the  fundamental  lemma  yields  the  Euler- 
Lagrange  equation 


dJf 
9  y 


(3.39b) 


This  again  shows  that  the  form  of  the  Euler-Lagrange  equation  (3.21)  generalises  to 
a  wide  array  of  problems. 

As  an  example,  consider  the  problem  of  finding  the  function  y(x)  >  0  on  0  < 
x  <  \  satisfying  boundary  conditions  y(0)  =  0  andy(l)  =  0,  with  a  given  arclength, 
S  =  fQ  yj\  +  (y')2  dx  that  maximises  the  area  under  the  curve,  A  =  fQ  y  dx. 

We  can  write  the  augmented  Lagrangian  Jzf  =  A  —  XP  to  maximise  the  enclosed 
area  subject  to  the  constraint  of  a  fixed  perimeter,  P  =  S  +  1  (which  includes  the 
contribution  from  the  lower  boundary,  y  =  0).  Applying  (3.39a,  3.39b),  we  obtain 


i  +  x,i(_jL= 

*  Vyi  +  «> 


dx  =  0. 


Solving  the  ODE  and  imposing  the  boundary  conditions,  yields 


y*(x)  = 


(3.40) 
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Fig.  3.4  The  solution  y*  (jc) 
(3.40)  maximising  the  area 
under  the  curve  on  0  <  v  <  1 
for  given  arclength  S 


which  can  be  integrated  to  yield  the  resulting  area  and  arclength  for  a  given  value  of 
the  Lagrange  multiplier  A.*, 


a(X*) 


y*  (v)  dx  =  X*  arcsin 


l 

4  ’ 


s(X*) 


-y/l  +  y^(x)2  dx  =  2A*  arcsin 


To  complete  the  solution  of  the  constrained  problem,  the  arclength  constraint,  s(X*)  = 
S,  must  be  applied  to  determine  a  value  for  A*  in  terms  of  S. 

From  (3.40)  we  see  that  acceptable  (real-valued)  solutions  must  have  A*  >  i, 
which  limits  S  in  the  problem  statement  to  be  in  the  range  1  <  S  <  7t/  2.  Noting 
that  (3.40)  describes  arcs  of  circles  going  through  the  fixed  endpoints,  with  centres 

given  by  ,  —■^jx'l  —  ^),  with  A*  being  the  radius  of  the  circle,  see  Fig.  3.4.  The  fact 

that  (3.40)  is  not  valid  for  S  >  7t/2  suggests  that  the  assumed  form  of  the  solutions 
(in  this  case,  graphs  of  functions  y  =  y(x))  may  be  too  restrictive  and  different 
representations  of  the  problem  could  be  helpful,  see  Exercise  3.20. 


3.7  Geometric  Constraints:  Holonomic  Problems 

Consider  the  problem  of  finding  parametric  equations  for  a  curve  (v*  (t) ,  y *  (t )  on 
0  <  t  <  T)  that  minimises  the  integral 

1=  /  L(t,x,  y,  x  ,  y)  dt, 

Jo 


(3.41a) 
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subject  to  the  constraint  that  for  each  value  of  t , 

g(x(t),y(t),t)  =  0.  (3.41b) 

Problems  of  this  form  are  called  holonomic  problems  in  the  context  of  dynamics  of 
mechanical  systems.  The  constraint  (3.41b)  imposes  a  condition  that  the  motion  in 
the  system  must  satisfy  at  each  instant  of  time,  as  typically  imposed  by  structural 
geometric  constraints  (as  in  the  case  of  a  roller  coaster  car  moving  on  its  track). 

It  can  be  shown  that  the  appropriate  generalisation  of  (3.34)  for  holonomic  prob¬ 
lems  is  the  augmented  Lagrangian 


Jz?(x,  y,  X)  =  L(t,x,  y,x',y')  -  X(t)g(x,y,  t)  ==>■  =  /  Jzf  dt.  (3.42) 

Jo 

Here,  a  notable  difference  from  (3.38)  for  isoperimetric  problems  is  that  the  Lagrange 
multiplier  is  a  function  of  t ,  rather  than  a  constant,  and  cannot  be  factored  out  of  the 
integral  as  we  did  in  (3.37).  Applying  independent  perturbations  to  each  unknown, 

x(t)  =  x*(t)  +  sh\(t),  y(t )  =  y*(0  +  sh2(t),  X(t)  =  A ,*(f)  +  ey(t), 


and  expanding  J>  as  s  — >  0,  we  obtain 


-gy 


(3.43) 


where  the  terms  in  the  first  line  give  the  contributions  from  the  original  objective 
function  and  the  terms  in  the  second  line  represent  the  influence  of  the  constraint. 
Noting  that  all  perturbations  are  independently  allowable,  in  enforcing  the  critical 
point  condition,  S  =  0  (through  linear  independence,  as  in  (3.24))  we  conclude 
that  the  coefficient  of  each  perturbation  must  vanish 


V/q  : 

vEuler-Lagrange : 

3 2z?  d  | 
dx  dt  * 

Vft2  : 

yEuler-Lagrange : 

d  | 

3  y  dt  ' 

WY -  * 

Vy  : 

Geometric  constraint: 

g(x*(t),y*(t),  t)  =  0, 

where  we  have  combined  the  L,  g  terms  from  (3.43)  to  write  the  Euler-Lagrange 
equations  in  a  more  compact  form  using  (3.42).  In  fact,  this  system  can  be  condensed 
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further  to  be  expressed  as 


u  Euler-Lagrange: 


d  Sd3f\ 

- - 1:Itt)=0  u  =  {x,y,X], 

ou  at  \  au  ) 


where  u  denotes  each  unknown  function  of  t  in  the  system.  In  other  contexts,  systems 
containing  both  geometric  constraints  and  differential  equations  are  often  also  called 
differential  algebraic  equations  (DAE). 


3.8  Differential  Equation  Constraints:  Optimal  Control 

For  holonomic  problems,  we  considered  optimising  a  functional  subject  to  constraints 
on  the  possible  solution  applied  pointwise  over  the  entire  domain.  We  now  extend 
the  analysis  to  systems  where  the  constraint  is  given  by  a  differential  equation.  In 
such  cases,  we  want  to  minimise 


J=  L(t,  x(t),  u(t))  dt, 

Jo 

subject  to  the  constraint  that  at  each  t  on  0  <  t  <  T, 


(3.44a) 


dx 

dt 


=  f(t,x(t),  u(t)), 


(3.44b) 


along  with  given  conditions  on  x(t) 


v(0)  =  vo  x(T)  =  x\ . 


(3.44c) 


Problems  of  this  form  arise  in  optimal  control  theory  which  seeks  to  determine  a 
control  function  u(t)  that  allows  the  state  function  x{t)  to  achieve  a  desired  target 
value  (or  “final  state”)  x\  while  also  minimising  the  “cost”  of  the  overall  process  in 
imposing  the  control  on  the  evolution  of  the  state.  The  cost  will  be  represented  by  the 
integral  (3.44a).  The  ending  time  T  will  also  be  considered  as  a  degree  of  freedom 
and  used  to  help  reduce  the  cost  and  make  the  target  achievable,  as  some  systems 
might  be  able  to  run  cheaply,  if  perhaps  slowly. 

Equation (3.44b)  is  called  a  state  equation ;  it  describes  how  the  system  evolves 
based  on  a  dynamical  model  relevant  to  the  problem  (e.g.  mechanics  or  chemical 
kinetics).  In  the  absence  of  external  forcing,  described  here  by  the  control  function 
u(t ),  ( u  =  0),  then  the  natural  ( uncontrolled )  dynamics  starting  from  the  initial 
condition  (3.44c)i  are  given  by  the  solution  of 


dx 


=  /(v,  0,  t)  v(0)  =  v0 


dt 


(3.45) 
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Under  typical  assumptions  on  the  rate  function  /,  this  initial  value  problem  has  a 
well-defined  solution  x(t).  However,  this  solution  might  not  equal  the  target  value 
x\  at  any  value  of  T  >  0,  or  might  reach  it  only  after  an  undesirably  long  time.  Both 
of  these  situations  would  make  the  uncontrolled  solution  x  unacceptable,  and  hence 
we  need  to  consider  modifying  the  evolution  of  the  state  from  (3.45)  by  forcing  the 
system  with  some  control  function  u(t)  yielding  (3.44b). 

Appropriate  control  functions  may  allow  the  state  to  reach  the  target,  but  may  also 
involve  excessive  input  of  energy,  or  could  still  take  too  long.  Hence,  the  “optimality” 
in  optimal  control  theory  refers  to  minimising  the  cost  involved  in  imposing  the 
control.  Two  basic  types  of  cost  functionals  that  can  be  used,  depending  on  whether 
system  speed  or  minimisation  of  the  energy  of  the  control  function  is  the  priority, 
are  given  respectively  by 

J speed  =  /  1  dt  —  T,  ^energy  ~  I  U  dt.  (3.46) 
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Examples  of  systems  that  can  be  described  in  this  form  include  the  input  control  of 
chemicals  in  reaction  systems  to  maintain  a  steady  output,  the  design  of  controlled- 
release  timed  drug  delivery,  and  car  power-steering  systems.  In  addition  to  uses  in 
“engineered”  systems,  such  models  can  also  be  applied  to  describe  how  biological 
systems  adapt  to  their  environments. 

System  (3.44)  is  a  classic  optimal  control  theory  problem — its  solution  involves 
determining  the  evolution  of  the  state  x(t),  the  control  u(t ),  and  the  optimal  stop¬ 
ping  time  T.  The  imposition  of  the  state  equation  at  each  time  suggests  forming  an 
augmented  Lagrangian  with  a  time-dependent  Lagrange  multiplier,  as  in  (3.42), 


££ (x,  x! ,  u,  X)  =  L(t,  x ,  u) 


-fix,  u,  t) 


) 


(3.47) 


and  corresponding  augmented  functional 


1=  f  J2f(v,  x,  u,  X)  dt.  (3.48) 

Jo 


Since  the  stopping  time  is  also  an  unknown,  we  will  also  draw  on  the  analysis  carried 
out  for  the  variable  endpoint  problem  in  Sect.  3.4.2. 

Applying  perturbations  to  all  of  the  unknowns 

x  =  x*  +  sh{t)  u  =  u*  +  svf)  X  =  X*  sy it)  T  =  T*  +  sS, 

the  cost  functional  is  then 


'T*-\-eS 


(v*  H-  s  h ,  x *  sh  ,  u*  -\-  sy)  dt . 


(3.49) 
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Expanding  I  as  a  Taylor  series  for  s  — >  0  gives 

dl 


I  —  I±  +  £ 


ds 


+tf(ez), 


£'=() 


where,  using  Leibniz’  rule  (3.28),  we  obtain  the  first  variation  in  the  form 

■T*  rdjf  d^f ,  djf  djf  i 

- h  T - h  + - v  T- - y  dt  T  J2*  S . 

I  dx  dx'  du  dX  y  \  t=n 

Applying  integration  by  parts  to  the  h '  term,  the  first  variation  becomes 

■r,  r 


'  dse 

d  , 

I 

- 

dx 

~~  ~dt  ' 

UW1 

d%  3«5f  ' 

h  +  — — v  +  ~^—y 


du 


dX 


dl£ 

dt  +  (  J£S  + - h 

dxr 


(■ 


) 


I  t=T* 

(3.50) 


where  we  have  used  the  initial  condition  (3.44c)i  to  obtain  that  the  perturbation 
satisfies  h( 0)  =  0.  Recalling  the  form  of  the  condition  at  the  variable  endpoint 
(3.26a),  we  write  the  target/final  value  condition,  x(T)  =  x\,  as 

x*(T*  +  sS)  +  sh{T *  +  sS)  =  x\.  (3.51) 


Expanding  this  equation  in  the  limit  s  ->  0,  we  recover  v*(T*)  =  x\  by  matching 
leading  terms,  and  at  O(s)  the  perturbation  function  satisfies 

h(T*)  =  —x*(T*)S.  (3.52) 


Substituting  this  into  the  first  variation  (3.50),  the  boundary  term  reduces  to 


( 


a«^ 

dx' 


) 


fi=r* 


This  combination  of  terms  is  called  the  Legendre  transform  of  the  Lagrangian  with 
respect  to  the  state  variable.  It  will  be  convenient  to  define  a  new  function,  the 
Hamiltonian ,8  from  this  combination, 


J^  =  ^-x 


,d 
dx ’ 


(3.53) 


8Note  that  in  a  different  context,  there  is  another  definition  of  the  Hamiltonian  having  H  =  —Jtf. 
Despite  the  difference  in  sign  conventions,  both  are  called  Hamiltonians,  see  Exercise  3.7. 
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Since  j£f  =  L  —  \(xr  —  /),  we  find  that  the  Hamiltonian  is  given  by 


—  L  - h  Xf , 


(3.54) 


and  hence  the  boundary  term  becomes  \  S.  Imposing  the  critical  point  condition 
81*  =  0  for  all  independent  perturbations  (V/t,  Vv,  Vy,  V5),  we  obtain  the  four 
equations: 


dx 


d jsf 

—  =0,  —  =0,  Jf(T*)  =  0.  (3.55) 

du  d  X 


The  last  equation  provides  a  natural  final  condition  on  the  Hamiltonian.  Making  use 
of  the  fact  that  it  can  be  shown  that  if  ££  does  not  explicitly  (non-autonomously) 
depend  on  t ,  then  the  Hamiltonian  is  constant  (see  Exercise  3.7),  the  condition  at  /* 
determines  that  =  0  for  0  <  t  <  T*. 

The  third  equation  in  (3.55)  returns  the  state  equation  (3.44b).  Similarly,  the  first 
equation  yields  an  ordinary  differential  equation  for  the  evolution  of  the  Lagrange 
multiplier, 


( 


dL 

dx 


+ 


dX 


* 


dt 


(3.56) 


which  is  called  the  co-state  equation ,  with  k*(t)  (responsible  for  imposing  the  state- 
equation  constraint)  being  called  the  co-state  in  control  theory. 

Finally,  the  perturbation  with  respect  to  the  control,  (3.55)2  gives  a  geometric 
constraint  on  the  control  at  each  time  t  on  0  <  t  <  T*, 


dL  df 

— +A*-^  =0.  (3.57) 

du  du 

We  apply  the  above  derivation  to  a  simple  example  of  an  optimal  control  problem. 
Consider  the  initial  value  problem  for  x(t), 

dx 

—  =  —3x  +  u  v(0)  =  2. 
dt 

We  immediately  see  that  the  uncontrolled  system  would  have  the  exponentially 
decaying  solution,  x(t)  =  2e~3t .  As  an  illustrative  problem  where  the  influence 
of  control  is  crucial,  let  the  final  target  state  for  T  >0  be 


x(T)  =  5, 


which  is  unattainable  with  the  uncontrolled  solution.  Consider  minimising  a  control- 
based  cost  functional,  like  /energy  from  (3.46), 
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min  /  =  A  /  u2  dt. 
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We  can  directly  identify  the  Lagrangian  function  from  /  as  L  =  ^u2  and  the  rate 
function  from  the  state  equation  as/  =  —3 x  +  u.  Hence  the  augmented  Lagrangian 
is 

Jzf  =  f  u2  —  k(x  +  3x  —  u), 

and  the  Hamiltonian  is 

Jf?  =  ^u2  +  k(u  —  3x). 

From  the  Euler-Lagrange  equations  (3.55),  we  then  find 

dk  dx 

— 3 A  T  —  =  (),  u  k  —  0,  —  3x  —  u  =  0. 

dt  dt 

Starting  with  the  algebraic  relation,  we  can  eliminate  the  control  in  terms  of  the 
co-state,  u  =  —A.  The  co-state  equation  for  A  can  then  be  re-expressed  for  u  as 

du 

—  =3 u  =>  u(t)  =  Ae  , 

dt 

where  A  is  a  constant  of  integration.  With  this  expression  for  u(t ),  the  state  equation 
becomes 

—  =  -3x  +  Ae3'  =>•  x(t)  =  Be~3'  +  \Ae3' , 
dt  6 

where  B  is  a  second  constant  of  integration.  Substituting  x,  u  and  A  into  the  Hamil¬ 
tonian  yields 

^  =  3AB  =  0, 


so  that  either  A  or  B  is  zero.  The  choice  A  =  0  returns  the  uncontrolled  state, 
and  so  we  conclude  that  B  =  0  and  hence  x(t)  =  2e3t.  Illustrating  the  dramatic 
influence  of  the  control,  the  optimal  controlled  solution  is  exponentially  growing, 
whereas  the  uncontrolled  solution  is  exponentially  decaying.  The  optimal  stopping 
time  can  then  be  obtained  directly  from  the  optimal  solution  for  this  simple  problem 
as  ^  ln(5/2). 


3.9  Further  Directions 

There  is  an  extensive  literature  on  both  the  theory  and  the  applications  of  the  calcu¬ 
lus  of  variations.  Some  good  texts  with  additional  introductory  material  and  other 
advanced  topics  include  [64,  66,  71,  84,  104].  Further  texts  also  provide  more  rig¬ 
orous  analysis  [99] .  Presentations  of  the  applications  of  the  calculus  of  variations  in 
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mechanics  [40,  62,  67]  and  other  areas  of  applied  physics,  including  electrostatics 
and  quantum  mechanics,  [104]  leading  to  ODE  and  PDE  problems  can  also  be  found 
in  many  classic  engineering  and  physics  textbooks.  There  are  also  numerous  books 
that  present  constrained  optimisation  and  optimal  control  problems  as  extensions  of 
the  basic  method  of  the  calculus  of  variations  [59,  62,  66,  71,  84,  99]. 

While  the  introductory  presentation  in  this  chapter  as  focused  on  formulating  prob¬ 
lems  leading  to  ODEs,  some  of  the  exercises  will  show  that  PDEs  can  be  derived 
similarly  from  multiple  integrals.  More  challenging  problems  arise  when  assump¬ 
tions  on  the  smoothness  of  solutions  are  removed,  then  optimal  solutions  may  be 
composed  of  multiple  piecewise-smooth  sections  that  must  satisfy  appropriate  con¬ 
nection  conditions  at  transition  points. 


3.10  Exercises 

3.1  Consider  the  functional  for  y(v)  on  0  <  v  <  1, 

J=  f  j(/)2  +  -yy'  +X2ydx, 

Jo  * 


where  k  is  a  constant 

(a)  Determine  the  ODE  for  y*(v)  by  taking  the  variation  of  /. 

(b)  Show  that  the  ODE  can  be  also  obtained  directly  from  the  Euler-Lagrange  equa¬ 
tion  (3.21). 

(c)  For  k  =  0,  determine  y*(v)  satisfying  the  boundary  conditions  y(0)  =  1, 

y(D  =  i. 

(d)  For  k  =  2,  determine  y*(v)  satisfying  the  boundary  conditions  /(0)  =  1, 
y(l)  =  1.  Show  there  is  no  solution  that  satisfies  y(0)  =  1. 


3.2  ( The  second  derivative  test)  As  in  single-  and  multi- variable  calculus,  whether  a 
critical  point  is  a  local  maximum  or  minimum  can  be  determined  from  the  next  term 
in  the  local  expansion  of  the  objective  function.  Recall  the  problem  of  minimising 
the  arclength  of  the  curve  y  =  y(x),  (3.10a), 

J=  [  J 1  +  (/)2  dx,  y( 0)  =  0,  y(l)  =  b. 

Jo 


Setting  y(v)  =  y*(v)  +  sh(x)  and  expanding  /  to  0(s)  yielded  the  Euler-Lagrange 
equation. 


(a)  Continue  the  expansion  of  /  to  the  0(s 2)  term  to  obtain  the  second  variation , 
S2J. 


(b)  Show  that  for  this  problem  the  solution  y*  (x)  =  bx  is  indeed  a  local  minimiser 
by  showing  that  the  second  variation  is  positive  for  all  h(x)  ^0. 
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Fig.  3.5  Exercise  3.3:  An 
admissible  trial  solution  in 
the  form  of  a  parametric 
curve 


3.3  Consider  finding  the  shortest  path  between  two  points  from  among  all  possible 

smooth  parametric  curves  in  the  xy  plane  x(t)  =  for  0  <  t  <  T.  Let  the 

curve  start  at  (x,  y)  =  (0,  0)  at  t  =  0  and  end  at  (x,  y)  =  (1,  b)  at  t  =  T  for  T  fixed 
(see  Fig.  3.5).  The  functional  for  the  arclength  of  the  curve  is 

J(x,y)=  [  J (x')2  +  O')2  dt. 

■Jo 

Show  that  the  Euler-Lagrange  equations  for  x,  y  give  the  parametric  equations  of  the 
straight  line  found  as  the  solution  of  (3.10a,  3.10b). 

3.4  For  problems  where  the  distance  between  two  points  is  not  given  by  the  Euclid¬ 
ean  distance,  geodesics  are  curves  giving  the  shortest  path  between  two  points. 
Consider  the  problem  of  determining  a  geodesic  between  two  points  on  a  surface 
z  =  F(x,y). 

(a)  For  a  parametric  curve  (x(0,  y(t),  z(t ))  that  minimises  the  arclength, 

J (x,  y)  =  J  y (xr)2  +  (yO2  +  (z')2  dt, 

observe  that  the  resulting  Euler-Lagrange  equations  for  are  rather 

complicated,  even  if  the  surface  is  the  simple  paraboloid,  z  =  k(x2  +  y2). 

(b)  Show  that  even  if  we  restrict  the  solutions  to  be  functions,  with  y  =  y(x), 
the  Euler-Lagrange  equation  for  y(x)  on  the  paraboloid  is  still  a  challenging 
nonlinear  ODE  for  k  ^  0. 

(c)  Consider  the  problem  of  finding  the  geodesic  from  (x,y)  =  (— 1,  0)  to  (1,  0) 
on  the  paraboloid.  Determine  the  arclengths  of  the  following  trial  solutions  for 
k  >  0  (from  geometry  or  by  calculating  the  integral): 


76 


3  Variational  Principles 


(i)  the  path  along  the  v-axis, 

(ii)  the  semicircular  path  connecting  the  points. 

What  can  you  infer  about  the  geodesic  from  the  limits  k  ->  0  and  k  — >  oo  of  (i, 
ii)? 


3.5  The  Euler-Lagrange  equations  for  constrained  motion  can  be  obtained  by  start¬ 
ing  with  the  Lagrangian  for  general  unconstrained  motion  and  then  substituting-in 
the  parametric  equations  describing  the  geometric  constraint  (a  curve  or  surface) 
before  taking  variations. 


(a)  Consider  the  action  integral  for  two-dimensional  motion  of  a  particle  subject  to 
gravity 


x'(t)2  +y\tf 


mgy(t )  dt. 


Consider  a  particle  constrained  to  be  on  a  circle,  x 2  +y2  =  t1.  Derive  the  equation 
of  a  pendulum  by  first  plugging  the  parametric  equations  x(t )  =  i  sin  Q(t), 
y(t)  =  —i  cos  0(0  into  I  and  then  applying  the  principle  of  least  action  with 
respect  to  6{t). 

(b)  Repeat  (a)  for  the  motion  of  a  particle  constrained  to  the  curve  y  =f(x). 

(c)  Consider  the  action  integral  for  three-dimensional  motion  of  a  particle  subject 
to  gravity 


x'itf+y'itf  +  z'it)2 


mgzit)  dt 


Derive  the  equations  of  motion  for  a  particle  moving  on  the  surface  of  a  cone, 
x2  +  y2  =  z 2,  using  the  parametric  equations  x(t)  =  r(t)  cos  0(t),  y(t)  = 
r(t)  sin  0(0  and  z(t)  =  r(t). 

3.6  Consider  a  pendulum  whose  suspension  point  is  vertically  oscillated.  Let  the 
length  of  the  pendulum  be  i  and  its  mass  m.  The  acceleration  due  to  gravity,  g,  is 
acting  downward  in  the  —  y  direction.  If  the  suspension  point  is  oscillating  according 
to  y(t)  =  —a  sin  (cot),  then  the  position  of  the  pendulum’s  mass  is 

x(t)  =  i  sin  0  (0 ,  y (t)  =  —a  sin(&>0  —  t  cos  0  (t) , 


where  0  is  measured  from  the  —  y  axis. 

Write  the  Lagrangian  for  this  system  in  terms  of  0,  O'  and  obtain  the  Euler- 
Lagrange  equation  for  this  problem,  called  the  parametrically  driven  pendulum. 


3.7  Define  the  Hamiltonian,  H ,  in  terms  of  the  Lagrangian  L(t,y,y through  the 
Legendre  transform  as 


(3.58) 


(Note  the  opposite  sign  convention  relative  to  (3.53).) 
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(a)  If  L  is  independent  of  time  (i.e.  L  =  L(y,y'))  show  that  the  Hamiltonian  is 
constant.  (Hint:  show  that  dH /dt  =  0) 

Further,  show  that  for  this  case,  the  second-order  Euler-Lagrange  ODE  (3.21) 
is  equivalent  to  the  first  order  ODE, 

-L(y,/)+/^  =  C,  (3.59) 

dy/ 

where  C  is  a  constant  (=//);  this  reduction  of  the  Euler-Lagrange  equation  to 
a  first  order  equation  is  called  the  Beltrami  identity ,  and  is  a  special  case  of 
Noether’s  theorem  [62]. 

(b)  If  the  potential  energy  V  does  not  depend  on  /,  determine  the  most  general 
form  for  the  kinetic  energy  T(t,y,y ')  so  that  the  Hamiltonian  is  the  total  energy, 
H  =  T  +  V. 

3.8  ( The  brachistochrone )  Recall  the  problem  of  finding  the  curve  of  least-time 
descent  under  gravity  from  (x,  y)  =  (0,  1)  — >  (1,  0),  starting  from  rest  (Fig.  3.1  and 
Eq.  (3.6)).  We  now  complete  the  problem  as  follows: 

1  o 

(a)  Assuming  there  is  no  friction,  the  total  mechanical  energy,  E  =  ^mv  +  mgy , 
(where  v2  =  x'(t)2  +  y\t)2)  remains  constant,  set  by  its  initial  value.  Use  this 
to  determine  v(y)  and  complete  the  expression  of  the  functional  in  (3.6). 

(b)  Apply  (3.21)  to  obtain  the  Euler-Lagrange  second  order  ODE  for  y*(x). 

(c)  Noting  that  the  integrand  in  (3.6)  does  not  explicitly  depend  on x,  use  the  Beltrami 
identity  (3.59)  to  obtain  a  simpler  first  order  ODE  for  y*(v).  What  is  the  value 
of  the  constant  C  in  this  ODE? 

(d)  Show  that  the  solution  can  be  expressed  in  parametric  form  by  the  equations  of 
a  cycloid  with  k  >  0  (see  Fig.  3.6), 

x(0)  =  k(0  -  sin<9),  y(0)  =  1  -  k(l  -  cos  0).  (3.60) 

What  is  the  pair  of  equations  that  must  be  solved  numerically  to  determine  the 
value  of  k? 


Fig.  3.6  Exercise  3.8:  The 
solution  of  the 
brachistochrone  problem, 
given  by  the  cycloid  (3.60) 


0 


1 
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3.9  Consider  the  functional  for  y(x)  on  1  <  x  <2 


dx. 


Find  the  solution  y*(x)  that  minimises  J  and  satisfies  the  boundary  condition  y( 2)  = 
17.  What  is  the  natural  boundary  condition  on  y*  at  v  =  1? 

3.10  Obtain  the  curve  y  =  y*  (v)  on  0  <  v  <  b*  that  starts  at  the  origin,  ends  on  the 
curve  y  =  1  +  (v  —  l)2,  and  minimises  the  functional 


dx. 


3.11  ( Higher-order  Euler-Lagrange  equations )  Derive  the  Euler-Lagrange  equa¬ 
tion  for  the  functional 


rb 

J(y)=  /  L(x,y(x),y"(x))dx 


Describe  the  kinds  of  natural  boundary  conditions  that  are  needed. 
3.12  Consider  the  functional  for  y(x)  on  0  <  x  <  1, 


■  l  r 


dy  d?y 
dx  dx^ 


240 xy 


dx 


where  y(x)  satisfies  the  boundary  conditions 


y(0)  =  o,  /(l)  =  0,  y"(0)  =  0,  y(l)  =  5. 


(a)  Write  the  expression  for  the  first  variation,  8J. 

(b)  Show  that  the  critical  point  condition  can  be  reduced  to  an  ODE  boundary  value 
problem.  Justify  how  each  of  the  six  boundary  terms  in  8J  are  eliminated.  Solve 
the  ODE  problem  for  y *  (v) . 

3.13  Consider  the  functional  for  y(x)  on  0  <  x  <  1, 


C/)2  +  (1 


dx. 


(a)  Write  the  expression  for  the  first  variation,  8J . 

(b)  Write  the  four  possible  boundary  conditions  on  y(x)  (two  at  each  boundary) 
under  which,  the  critical  point  condition,  8J  =  0,  can  be  reduced  to  an  ODE  for 

y{x). 
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3.14  Consider  the  functional  for  y(x)  on  0  <  v  <  1, 


dx  +  /(0)/(l) 


with  y(x)  satisfying  the  boundary  conditions 


y(0)  =  2,  y(l)  =  5, 


Determine  the  ODE  boundary  value  problem  for  solutions  that  minimise  or  max¬ 
imise  /. 

3.15  Fermat’s  principle  of  least  time  states  that  a  beam  of  light  will  take  a  path 
that  minimises  its  time  of  travel.  The  index  of  refraction,  n,  of  a  material  gives  the 
ratio  of  the  speed  of  light  in  vacuum  to  the  slowed  speed  of  light  in  the  material, 
n  =  c/v  >  1. 

Consider  a  layer  of  glass  on  0  <  r  <  1  whose  index  of  refraction  varies  with 
position,  n  =  n(x).  Let  y(v)  describe  the  path  of  a  beam  of  light  entering  the  layer  at 
v  =  0  at  a  45°  angle,  / (0)  =  1  (see  Fig.  3.7  (left)). 

(a)  Making  use  of  functional  (3.6),  derive  the  second-order  ODE  problem  for  y(x) 
and  show  it  can  be  reduced  to  a  first-order  equation. 

(b)  If  n(x)  is  piecewise  constant,  n\  for  v  <  ^  and  n2  for  v  >  show  that  part  (a) 
reduces  to  Snell’s  law  [91]  (see  Fig.  3.7  (right)). 

(c)  What  is  the  form  of  the  Euler-Lagrange  equation  if  n  =  n(x,y)l 


3.16  (The  beam  equation)  Hamilton’s  principle  of  least  action  can  be  applied  to 
derive  the  time-dependent  partial  differential  equation  for  the  transverse  deflections 
y  =  u(x,  t)  of  a  flexible  solid  rod  or  beam.  The  problem  for  a  one-dimensional  beam 
is  characterised  by  the  following  properties: 


Fig.  3.7  Exercise  3.15:  (Left)  diffraction  of  the  path  of  a  ray  of  light  due  the  change  of  the  index 
of  refraction  with  position,  n  —  n(x),  (Right)  The  special  case  when  n  is  piecewise  constant 
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•  The  length  of  the  beam  is  i  (domain:  0  <  x  <  £)  with  constant  mass  density  p 
(mass  per  unit  length),  constant  bending  stiffness  E  (like  a  spring  constant)  and 
constant  moment  of  inertia  I . 

•  The  overall  total  kinetic  and  potential  energies  of  the  beam  are  given  by  integrals 
over  the  length  of  the  beam  of  the  corresponding  energy  density  functions,  e(u ), 

(o  =  Jq  e(u )  dx. 

•  The  kinetic  energy  density  is  \ p(dtu )2. 

•  The  potential  energy  density  due  to  bending  (i.e.  transverse  deflection  or  buckling) 
is  jEIidxxU)2. 

Follow  these  steps  to  derive  the  beam  equation: 

(a)  Write  the  integrals  for  the  total  kinetic  T  and  potential  V  energies  of  the  beam. 

(b)  Write  the  action  integral  J(u).  (Hint:  /  is  a  double  integral) 

(c)  Apply  the  principle  of  least  action  to  derive  the  beam  equation  for  u(x,  t)  and 
state  the  choices  of  natural  boundary  conditions  for  u  at  v  =  0  and  x  =  l.  Two 
boundary  conditions  (out  of  2  sets  of  2  options  for  natural  boundary  conditions) 
are  needed  at  each  boundary. 

3.17  For  a  given  positive  function  k(x,  y)  >  0,  consider  the  integral  for  the  function 
u(x,  y)  on  a  finite  two-dimensional  domain  D , 


J(u)  = 


Derive  the  partial  differential  equation  for  the  solution  u(x,  y)  and  two  appropriate 
forms  of  natural  boundary  conditions  on  u(x,  y )  that  produce  minima  of  J(u). 

If  k  is  a  constant,  show  that  the  PDE  reduces  to  Laplace’s  equation. 

Hint:  Recall  the  vector  version  of  the  derivative  product  (for  the  divergence  of  a 
scalar  times  a  vector),  V  •  (f  g)  =  (V/)  •  g  +/(V  •  g),  and  then  make  use  of  the 
divergence  theorem. 

3.18  How  does  the  problem  of  minimising  the  arclength  of  a  non-negative  function 
y(x)  >  0  on  0  <  v  <  1  that  encloses  a  fixed  given  area  A  =  Jq  ydx  relate  to  the 
isoperimetric  example  given  in  Sect.  3.6? 

3.19  Find  the  solution  y(x)  on  0  <  x  <  tt  that  minimises 

r  ,  o  r  2 

J(x)  =  /  1  +  (y  )  dx  subject  to  the  constraint  /  yA  dx  =  80 

Jo  Jo 

and  boundary  conditions  y(0)  =0  and  y(7r)  =  1. 

(a)  Write  the  augmented  Lagrangian  Jzf  for  this  problem.  Determine  the  Euler- 
Lagrange  problem  and  solve  for  y(x,  X)  subject  to  the  boundary  conditions, 
where  X  is  the  Lagrange  multiplier. 
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(b)  Evaluate  the  integral,  /(A)  =  f  y2  dx ,  as  a  function  of  A.  Plot  this  function  and 
estimate  the  values  of  A  that  yield  solutions.  If  the  integral  constraint  is  required 
to  equal  one  instead  of  80,  what  happens  to  the  solutions?  What  is  the  minimum 
value  of  I  for  which  solutions  exist? 

3.20  (The  classic  isoperimetric  problem)  This  problem  will  lead  you  through  deriv¬ 
ing  that  the  circle  is  the  closed  curve  of  fixed  perimeter  P  that  encloses  the  maximum 
area: 

Let  x(t),  y(t)  be  the  parametric  equations  for  a  closed  curve  with  0  <  t  <  1  that  goes 
through  the  origin: 


*(0)=*(1)  =  0,  y(0)  =  y(l)  =  0. 


(a)  Use  Green’s  theorem  to  show  that  the  area  enclosed  by  the  curve  is  given  by 


1 


\x(t)y'(t)  -  y(t)x'(t )]  dt. 


Show  that  the  augmented  objective  function 


J?(x,y,  X)  =  Uxy'  -yx)  -  X 


y (x')2  +  (y')2  -  P 


with  J  =  fj  Jzf  dt ,  defines  the  problem  of  maximising  the  area  for  a  closed  curve 
with  perimeter  P. 

(b)  Obtain  the  Euler-Lagrange  equations  for  x(t)  and  y(t). 

(c)  Integrate  each  once  with  respect  to  t  and  show  that  they  can  be  combined  to  yield 
the  equation  for  a  circle  with  the  radius  given  in  terms  of  |A|. 

(d)  Determine  A  so  that  the  perimeter  constraint  is  satisfied.  Determine  the  possible 
positions  for  the  centre  of  the  circle. 

3.21  (Sturm- Liouville  eigenvalue  problems)  For  given  functions  p(x)  >  0  and  q(x ), 

show  that  the  functional  for  y(x) 


1=  [  p(x)(y')2  -  q(x)y2  dx 

Jo 

yields  the  Euler-Lagrange  equation 

d  (  dy\ 

—  I  p(x)—  )  +q{x)y  =  0. 
dx  \  dx  J 

For  homogeneous  boundary  conditions  on  y(x),  the  trivial  solution  y(v)  =  0,  is  a 
critical  point  for  all  p,  q.  Show  that  seeking  nontrivial  critical  points  of  I  satisfying 
the  normalisation  condition, 

l 

y  cr(x)  dx  =  1 , 
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for  a  given  weight  function  a  (v)  >  0,  yields  the  Sturm-Liouville  equation 


d  (  dy\ 

,  I  p(x)  —  1  +  (q(x)  +  Xa (x))y  =  0, 
dx  \  dx  J 


where  A  are  eigenvalues  and  y(x)  are  the  corresponding  eigenfunctions. 

3.22  ( The  pendulum  revisited )  Consider  again,  the  action  integral  for  motion  of  a 
point  mass  in  two  dimensions  subject  to  gravity  (recall  Exercise  3.5(a)), 


x\t)2  +  y\t)2 


mgy(t)  dt. 


Suppose  that  the  mass  is  constrained  to  move  on  the  circle,  x2  +  y2  =  i2.  Consider 
this  as  a  problem  with  a  holonomic  constraint.  Write  the  Euler-Lagrange  equations 
for  x(t ),  y(t).  Note  that  the  Lagrange  multiplier  appearing  in  these  equations  can  be 
interpreted  as  a  force  required  to  keep  the  mass  on  the  circle.  Show  that  the  equations 
can  be  reduced  to  the  pendulum  equation  for  0  ( t ) . 

3.23  Consider  the  problem  of  finding  a  locally  optimal  solution  of  the  functional 


dx 


subject  to  the  conditions  that 


y(2)  =  j(l)  +  3,  J  24xy  dx  =  5. 


(a)  Write  the  augmented  functional  for  the  constrained  optimisation  problem. 

(b)  Determine  the  natural  boundary  condition  that  allows  the  critical  point  condition 
to  be  reduced  to  an  ODE  problem. 

(c)  Write  the  general  solution  of  the  ODE  (homogeneous  and  particular  terms)  and 
the  system  of  equations  to  determine  the  three  constants  in  your  solution. 

3.24  Find  the  locally  optimum  solution  of  the  functional 


(/)2 

^3 


dx 


subject  to  the  conditions  that 


y  dx  =  —3. 


y(l)  =  4,  y(2)  =  —10, 


2 
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3.25  The  Pontryagin  Maximum  Principle  (PMP)9  is  a  classic  result  from  optimal 
control  theory.  For  the  classic  optimal  control  problem  (3.44)  the  PMP  states  that 
the  equations  for  the  optimal  solution  can  be  very  concisely  given  in  terms  of  the 
Hamiltonian,  =  L  +  Xf, 

dx  d^f  dX  dJ^  dJ^ 

—  = - ,  = - ,  - =0, 

dt  dX  dt  dx  du 

Subject  to  =  0.  Show  that  these  equations  reproduce  (3.55)  and  these  equa¬ 

tions  are  consistent  with  the  Hamiltonian  being  a  constant  for  all  times.  (Hint:  Apply 
the  chain  rule  to  evaluate  dJ^f/dt) 

3.26  Determine  the  solution  x(t)  and  the  control  function  u(t)  that  satisfy  the  state 
equation 

dx 

— -  =  3x  +  u  0  <  t  <  T 
dt 

with  initial  and  final  conditions 


*(0)  =  2,  *(r)  =  l, 


while  minimising  the  cost  functional 


3 xu  H-  u 


dt. 


(a)  Use  the  Pontryagin  principle  from  Exercise  3.25  for  the  case  where  the  final  time 
is  the  optimal  stopping  time  T  =  T*. 

(b)  If  instead  the  final  time  is  specified  as  T  =  1  /4,  what  is  the  optimal  solution? 
What  is  the  value  of  the  Hamiltonian? 

3.27  ( The  brachistochrone  revisited)  Recall  Exercise  3.5(b),  where  we  determined 
that  the  equation  of  motion  for  the  horizontal  position  x(t)  of  a  particle  sliding  down 
a  ramp  y  =  fix)  under  the  influence  of  gravity  is 


d 

dt 


O' ) 


-gf\x)  +f(x)f"(x) 


Use  this  result  applied  to  functions  satisfying  the  boundary  conditions  /( 0)  =  1 
and/(l)  =  0  to  show  that  the  brachistochrone  problem  can  be  expressed  as  an 
optimal  control  problem  on  /  subject  to  minimising  travel  time  T  for  a  particle 
satisfying  v(0)  =  0  and  x(T)  =  1.  Write  a  Lagrangian  analogous  to  (3.47)  and  the 
corresponding  functional  to  determine  the  Euler-Lagrange  problem. 


9 Sometimes  called  the  Pontryagin  minimum  principle,  depending  on  the  choice  of  sign  convention 
used  for  H  versus  recall  page  7 1 . 


Chapter  4 

Dimensional  Scaling  Analysis 


“ Back  of  the  envelope  calculations  ”  are  simple  algebraic  calculations  that  can  fit 
into  a  small  space  but  provide  substantial  guidance  on  understanding  problems.  Such 
calculations  are  often  based  on  dimensional  analysis ,  which  identifies  the  relations 
between  system  properties  that  must  be  present  in  a  problem  due  to  the  fundamental 
types  of  the  properties  being  considered.  It  is  valuable  to  understand  that  insight 
on  how  a  system  can  be  expected  to  behave  can  be  obtained  from  interpretation  of 
appropriate  products  of  the  parameters  characterising  the  problem. 

One  simple  type  of  dimensional  analysis  result,  sometimes  called  a  Fermi  estimate , 
provides  predictions  based  on  products  of  dimensional  rates,  as  in  the  number  of 
widgets  produced  in  a  year  =  (number  of  widgets  produced  per  day)  x  (number 
of  work-days  per  week)  x  (average  number  of  weeks  per  month)  x  (months  per 
year).  This  result  does  not  take  account  of  the  details  of  the  problem,  but  gives  an 
order  of  magnitude  estimate  of  the  expected  number.  Dimensional  analysis  can  not 
replace  detailed  solutions  of  problems,  but  it  can  provide  a  guide  for  comparison 
with  expectations  and  identification  of  important  influences  in  problems. 

In  this  chapter,  our  focus  will  be  on  the  use  of  dimensional  analysis  to  nondimen- 
sionalise  entire  problems  in  order  to  provide  simplified  re- statements  of  the  problems 
that  identify  a  reduced  (minimal)  number  of  essential  parameters  that  solutions  should 
depend  on.  We  will  illustrate  the  use  of  dimensional  analysis  through  its  application 
to  a  set  of  examples  before  discussing  the  Buckingham  Pi  theorem,  which  gives  a 
mathematical  framework  to  support  the  results  of  the  elementary  scaling  calculations. 


4.1  Dimensional  Quantities 

Every  physical  quantity  Q  can  be  expressed  as  the  product  of  a  dimensional  unit , 
denoted  here  by  [Q]  (describing  the  physical  nature  of  a  single  ‘portion’  or  unit  of  the 
quantity),  and  a  magnitude  q  (the  number  of  portions),  i.e.  Q  =  g[Q].  For  example, 
mass  can  be  measured  in  dimensional  units  of  kilograms,  weight  in  pounds,  volume 
in  litres,  data  in  gigabytes,  frequency  in  megahertz,  length  in  meters,  time  in  seconds, 
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temperature  in  degrees  Kelvin,  and  so  on.  These  choices  are  not  unique:  Q  =  1  mile 
=  1,760yd  =  5,280ft  =  63,360in  =  160,934.4cm  =  1.7011  x  10“13  light  years; 
all  of  these  are  equivalent  descriptions  of  the  same  length  with  respect  to  different 
units  of  measure  and  different  corresponding  magnitudes. 

The  choice  of  dimensional  units  for  measurements  should  be  appropriate  to  the 
problem  under  consideration.  For  example,  it  would  be  very  inconvenient  to  measure 
interplanetary  distances  in  feet  (the  magnitudes  involved  being  literally  astronomi¬ 
cally  large)  or  to  record  the  size  of  a  bacterial  swarm  in  kilometers.  Having  to  deal 
with  very  large  (or  very  small)  numbers  is  not  only  cumbersome,  but  the  analysis  in 
some  later  chapters  will  crucially  rely  on  separating  small  from  large  quantities  and 
a  bad  choice  of  measurement  scales  can  obscure  this  process. 


4.1.1  The  SI  System  of  Base  Units 

In  many  everyday  physical  problems,  it  is  reasonable  to  employ  dimensional  units 
of  meters  for  displacement  ([Length]  =  meters),  kilograms  for  mass  ([Mass]  = 
kilograms),  and  seconds  for  time  ([Time]  =  seconds).  This  is  a  standard  set  of  units J 
forming  part  of  what  is  known  as  the  International  System  (SI)  of  Base  Units  used  for 
quantifying  distinct  physical  properties.  The  seven  fundamental  (base)  dimensional 
units  that  make  up  the  SI  system  are  [97] 

[Length]  =  meter,  [Time]  =  second,  [Mass]  =  kilogram,  (4.1) 

[Temperature]  =  Kelvin,  [Electric  current]  =  ampere, 

[Light  intensity]  =  candela,  [Material  quantity]  =  mole. 

These  dimensional  units  ar z  fundamental  in  the  sense  that  they  describe  independent 
physical  properties  that  cannot  be  represented  in  terms  of  each  other.  Dimensional 
units  that  can  be  expressed  in  terms  of  the  fundamental  units  are  known  as  derived 
units.  For  example,  [speed]  =  meter/second,  [acceleration]  =  meter/second2,  and 
[force]  =  kilogram- meter/second2.  The  most  commonly  occurring  derived  units  are 
given  their  own  names  for  convenience,  such  as  [force]  =  newton,  [pressure]  = 
pascal,  and  [energy]  =  joule.  For  some  problems,  it  can  be  convenient  to  express 
quantities  with  respect  to  different  sets  of  base  units,  those  of  easily-measurable 
properties,  for  example,  in  terms  of  the  pressure  of  a  gas,  [force]  =  [pressure]  [area] . 

Derived  dimensional  units  are  products  of  powers  the  fundamental  units 

[Q]  =  [Length]0  [Time]^  [Mass]7  .... 

This  result  also  extends  to  products  of  quantities,  [QyCL  |  =  [Qi]2[Q2]. 


1  Also  called  the  MKS  system  from  Meters,  Kilograms,  Seconds. 
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In  what  follows,  in  order  to  emphasise  that  it  is  the  nature  of  a  quantity  (length, 
mass,  time  and  so  on)  that  we  are  interested  in  rather  than  its  representation  in  a 
particular  system  of  base  units,  we  will  simply  talk  about  the  quantity’s  dimensions 
instead  of  explicitly  referring  to  dimensional  units. 


4.2  Dimensional  Homogeneity 

The  principle  of  dimensional  homogeneity  states  the  intuitive  result  that  all  terms 
summed  to  yield  an  equation  modelling  a  system  must  have  the  same  dimensions. 
For  example,  sums  of  forces  in  Newton’s  second  law,  or  sums  of  electrical  currents 
in  Kirichoff’s  law,  or  sums  and  differences  of  amounts  of  money  in  accounting 
calculations.  In  contrast,  it  not  valid  to  add  a  velocity  to  an  acceleration,  or  a  time  to 
a  mass. 

An  immediate  consequence  is  that  any  complicated  (non-monomial)  function, 
such  as  sin,  exp,  tan,  log,  and  generally  any  /(X)  must  have  dependent- 

arguments  (variables)  that  are  dimensionless.  Namely  the  dimensional  units  in  the 
variables  must  cancel-out  identically.  To  see  why  this  must  be  true,  consider  writing 
the  exponential  function  in  terms  of  its  Taylor  series  expansion 

ex  =  1  +  X  +  iX2  +  •  •  •  +  -X"  +  •  •  •  . 

z  n\ 

If  X  was  not  dimensionless,  then  each  term  in  the  Taylor  expansion  would  have 
different  dimensions  thereby  violating  the  principle  of  dimensional  homogeneity; 
here  all  terms  must  have  the  same  units  as  the  first  term  in  the  expansion,  [1]  which 
is  dimensionless.  We  note  that  variables  representing  angles  are  dimensionless,  as  in 
the  formula  for  the  arclength  of  circle,  S  =  R0,  [length]  =  [length]  [dimensionless] . 

As  an  illustrative  example  of  dimensional  homogeneity,  consider  the  motion  of  a 
vertically  launched  projectile.  If  we  denote  Y  as  the  projectile’s  height  (with  initial 
height  Yo),  T  as  the  time  passed,  Vo  as  the  initial  velocity,  and  Ao  as  a  constant 
acceleration  (typically  gravity),  the  equation  of  motion  for  the  projectile  is  simply  a 
quadratic  form  in  T 

Y  =  Yo  +  V0T  +  iA0T2. 

Expressed  in  terms  of  its  dimensions,  we  see  that  each  term  has  the  same  dimensions, 
namely 


[Length]  =  [Length]  + 


Length 

Time 


[Time]  + 


Length 

Time2 


[Time2]. 
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4.3  The  Process  of  Nondimensionalisation 

The  importance  of  dimensional  homogeneity  lies  in  the  realisation  that  as  each  term 
in  an  equation  must  have  the  same  dimensions,  we  can  scale  all  dependent  and 
independent  variables  by  dimensional  constants  to  yield  dimensionless  equations. 
Effectively,  this  means  that  the  dimensional  units  have  been  factored  out  of  the  orig¬ 
inal  problem,  leaving  a  “clean”  mathematical  system  in  terms  of  only  dimensionless 
magnitudes  and  dimensionless  parameters. 

The  dimensions  of  the  scaling  constants  are  predetermined  by  dimensional  homo¬ 
geneity,  but  we  have  the  freedom  to  select  their  magnitudes  in  order  to  obtain  ‘con¬ 
venient  forms’  of  the  nondimensional  model  that  will  be  better- suited  for  further 
mathematical  analysis. 

In  the  following  examples,  we  will  introduce  standard  conventions  on  how  to 
select  dimensionless  parameters  as  well  as  highlighting  further  consequences  of 
dimensional  analysis. 


4.3.1  Projectile  Motion 


Consider  a  projectile  of  mass  M  kilograms  that  is  launched  vertically  with  initial 
speed  Vo  m/s,  from  a  position  Yo  meters  above  the  surface  of  the  Earth.  Newton’s 
universal  law  of  gravitation  coupled  with  the  second  law  of  motion  then  gives  that  the 
height  of  the  projectile  Y  (T)  varies  with  time  T  according  to  the  ordinary  differential 
equation 


.  ,d2Y  GMeM 

M - = - - - 

dT2  (Re  +  Y)2’ 


Y(0)  =  Y0, 


Y'(0)  =  V0, 


(4.2) 


where  the  Earth’s  properties  (mass  and  radius)  are  ME  =  6  x  1024kg  and  Re  = 
6.4  x  106m,  and  the  universal  gravitational  constant  G  =  6.7  x  10  11m3/(s2kg). 
We  begin  by  noting  that  the  parameters  G  and  Me  only  occur  as  a  product  and  so 
can  only  appear  in  the  solution  in  the  same  form.  Consequently,  we  can  regard  this 
combination  as  a  single  parameter  and  replace  it  using  g  =  GMe/R|  ~  9.81  m/s2, 
this  being  the  familiar  value  for  acceleration  due  to  gravity  at  the  Earth’s  surface. 
The  problem  then  takes  the  form 


r/2  y  gR|M 

dT2  (Re  +  Y)2’ 


Y(0)  =  Y0,  Y'(0)  =  V0. 


(4.3) 


We  now  introduce  the  (as  yet  arbitrary)  length  and  time  scales,  L  and  T, 


Y(T)  =  L  y(t),  T  =  T  t. 


(4.4) 
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so  that  y  and  t  are  dimensionless  variables.  The  scaling  constants  (L  and  T)  for  the 
different  independent  dimensional  quantities  in  a  problem  (here,  length  and  time) 
are  called  the  characteristic  scales. 

Substituting  into  (4.3)  yields  the  rescaled  problem 


L  d2y 
T2  dt2 


MO)  =  Y0, 


!=/(0)  =  V0. 


Dividing  through  each  expression  by  the  dimensional  coefficient  of  the  first  term  on 
the  left-hand-side  yields  the  nondimensionalised  problem 


(4.5) 

where  the  77/  are  nondimensional  constants.  It  is  important  to  realise  that  such  dimen¬ 
sionless  parameters  always  represent  the  ratio  of  two  competing  effects.  For  example, 
77i  represents  the  relative  importance  of  gravitational  acceleration  versus  the  accel¬ 
eration  of  the  projectile,  77i  =  g/(L/T2).  Likewise,  TJ2  compares  the  length  scale 
of  interest  to  the  Earth’s  radius,  772  =  L/Re- 

We  have  the  freedom  to  choose  the  length  and  time  scales  L  and  T  to  scale  the 
problem  into  a  desired  form;  here  we  describe  one  such  form.  Selecting  the  length 
scale  L  =  Yo  sets  773  =  1,  while  choosing  a  timescale  based  on  acceleration  due 
to  gravity,  T  =  (L/g)1/2,  fixes  TJ\  =  1.  Through  (4.4),  the  selection  of  L  and  T 
gives  the  respective  units  of  measure  that  all  lengths  and  times  in  the  problem  will 
be  calibrated  against.  Characteristics  scales  given  directly  by  a  single  given  quantity 
from  the  original  problem  (here  L  =  Yo)  are  sometimes  called  imposed  scales.  Scales 
obtained  from  combinations  of  given  quantities  (here  T  in  terms  of  Yo  and  g)  are 
called  derived  scales. 

We  note  that  there  are,  in  fact,  many  possible  scaling  choices  as  we  could,  for 
example,  multiply  either  (or  both)  of  the  length  or  time  scales  by  some  function 
/( Yo/Re)  and  the  resulting  equation  would  still  be  nondimensional.  However,  it 
turns  out  that  the  normalised  forms  are  often  the  most  convenient  choices. 

We  have  employed  what  we  shall  refer  to  as 


The  1st  general  nondimensional  scaling  principle: 

Select  the  characteristic  scales  so  that  as  many  as 
possible  of  the  77 ’s  are  normalised. 


(4.6) 
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For  the  current  example,  the  nondimensionalised  model  takes  the  simplified  form 


d2y  =  1 

dt2  (l  +  n2y)2’ 


y(0)  =  1,  y(0)  =  774. 


(4.7a) 


with 


n2  = 


L  Y 


o 


Re  Re’ 


n4  = 


VqT  _  Vp 

l  (gY0)1/2 


(4.7b) 


The  solution  of  (4.7a)  will  be  a  function  of  the  independent  variable  t  and  the  remain¬ 
ing  nondimensional  constants  772  and  TI4,  namely 


y  =  y(t,  n2,  774), 


or  equivalently, 

Y  =  Y 0y(f,  n2, 774),  T  =  (Y0/g)1/2f, 

in  contrast  to  the  solution  of  the  original  problem  (4.3)  which  takes  the  general  form 


Y  =  Y(T,  g,  Re,  Y0,  V0). 


We  have  effectively  reduced  the  complexity  of  the  original  problem  (involving  a 
solution  function  of  five  variables)  to  a  solution  depending  on  only  three  variables. 

As  will  be  discussed  further,  nondimensionalization  allows  us  to  examine  different 
limiting  forms  of  problems  in  a  mathematically  precise  framework.  Consider  the 
limit  772  =  Yo/Re  — >  0  in  the  model  (4.7).  At  first  sight,  this  limit  may  appear 
to  describe  increasing  the  radius  of  the  Earth  (Re  — >  00),  or  decreasing  the  initial 
height  (Yo  — >  0),  or  both.  For  the  limit  Re  — >  00  (with  Yo  fixed),  we  can  formally 
set  772  =  0  to  reduce  (4.7a)  to  a  simpler  ODE  that  can  be  easily  solved, 

d2y  1  9 

—p  =  -1  =4>  y(t)  =  1  +  774t  -  \t2. 

This  solution  can  be  interpreted  as  a  first  approximation  to  the  motion  of  a  projectile 
that  is  launched  relatively  close  to  the  surface  of  a  large-radius  planet  with  the  same 
gravity  as  the  Earth.  Undoing  the  nondimensional  scaling  (4.4)  converts  this  to  the 
dimensional  form  of  the  classic  quadratic  polynomial  for  projectile  motion, 

Y  =  Y0  +  V0T-  igT2. 

However  if  we  consider  the  limit  Yo  — >  0  (with  Re  fixed)  then  774  — >  00;  this  is  not 
acceptable  as  a  value  of  for  the  initial  condition  on  /(0)  and  forces  us  to  question 
whether  we  have  selected  an  appropriate  choice  of  characteristic  scales. 

In  fact,  if  scalings  yield  any  undefined  terms  in  a  non-dimensionalized  model, 
then  that  model  in  that  form  will  not  be  solvable.  This  points  to  the  need  to  change 
the  scalings  to  put  the  model  into  a  well-defined  form,  which  we  express  as 
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The  2nd  general  nondimensional  scaling  principle: 

Select  characteristic  scales  so  that  NO  terms  in 
the  model  diverge  in  the  physical  limit  of  interest. 


(4.8) 


The  choice  of  scales  is  determined  once  the  sub-set  of  77’s  to  be  normalised  is 
made;  hence  other  choices  correspond  to  picking  different  77’ s.  For  problems  with  k 
characteristic  scales,  the  scales  will  be  determined  from  normalising  k  77’s;  this  will 
be  discussed  further  in  connection  with  the  Buckingham  Pi  theorem,  see  Sect.  4.5. 

To  examine  the  Yo  — >  0  limit,  consider  setting  77i  =  1  and  774  =  1  in  (4.5).  The 
length  and  time  scales  are  then  L  =  V^/g  and  T  =  Vo/g.  The  revised  form  of  the 
nondimensionalised  problem  is 


(1  +  fl2y)2 ' 

3 

II 

/(0)  =  1. 

L  M2 

Re  9Re’ 

773  =  Y° 
L 

II 

<  ^ 

oto  g 

(4.9a) 


(4.9b) 


This  choice  of  scalings  satisfies  (4.8)  since  setting  Y0  =  0  sets  773  =  0  but  does  not 
cause  any  other  parameters  in  the  problem  to  diverge, 


<fy  _  l 

dt 2  (\+n2y)r 


y(  0)  =  o,  /(0)  =  i. 


If  we  further  specify  that  Re  ->  oo  then  772  =  0  and  the  ODE  reduces  to  y"  =  —  1 
yielding  the  solution 

y(t)  =  -\t2  +  t  =►  Y  =  V0T-igT2. 

For  V  o  >  0  the  time  of  flight  of  the  projectile  is  clearly  seen  to  be  2Vo/g  while  the 
maximum  height  reached  is  Vg/(2g),  both  of  which  are  directly  proportional  to  the 
scales  that  we  chose. 


4.3.2  Terminal  Velocity  of  a  Falling  Sphere  in  a  Fluid 

We  now  consider  further  aspects  of  scaling  in  nondimensionalizing  another  example 
from  physics — describing  the  motion  of  a  solid  ball  with  a  given  initial  velocity 
moving  vertically  downward  through  a  viscous  fluid. 

The  ball  has  radius  R  m  and  uniform  density  p  kg/m3  (so  that  the  mass  of  the  ball 
is  given  by  M  =  ^7rR3p).  The  initial  speed  of  the  ball  is  taken  to  be  Vq  m/s.  The 
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fluid  has  a  different  density,  p /  and  its  viscosity  is  p  kg/m  s,  which  gives  a  measure 
of  the  frictional  resistance  to  the  ball  moving  through  the  fluid. 

Noting  that  the  forces  acting  on  the  ball  are  due  to  its  weight,  buoyancy  and  the 
drag  of  the  fluid,  we  can  use  Newton’s  second  law  to  write  down  a  force  balance 

d\/ 

=  Mg  —  Fbuoy  —  F  drag ,  V(0)  =  Vo, 

where  we  have  expressed  the  acceleration  as  the  derivative  of  the  ball’s  velocity.  The 
magnitude  of  the  buoyancy  force  is  given  by  the  weight  of  the  fluid  displaced  by  the 
ball,  FkUoy  =  ^7rR3/9/g,  while  the  frictional  force  acting  on  a  sphere  is  given  by 
Fbrag  =  67r/iRV,  a  classic  result  derived  by  Stokes. 

Consequently,  the  problem  can  be  restated  as 

4  o  dM  4  o  4  o 

-7rR3p—  =  -7rR3pg  —  -ttR3  ppg  —  67r/iRV.  (4.10) 

3  cr f  3  3 

Introducing  the  nondimensionalization,  V(T)  =  Vv(t)  with  T  =  Tt,  where  V  and  T 
are  characteristic  scales  for  speed  and  time  yields  the  nondimensional  model 


dv  ( (p-  Pf)  qt 


dt 


( 


Pv 


)-©> 


v,  v(0)  = 


n2 


n3 


(4.11) 


Setting  Fl\  =  1  and  =  1  yields  the  characteristic  scales 


V  =  V0,  T  = 


and  the  nondimensionalized  problem  reads: 


pV  o 


ip-  Pf)  g’ 


(4.12) 


dv 

dt 


=  1  —  St  v,  v(0)  =  1, 


where  the  772  group  has  been  renamed  the  Stokes  parameter 


St  = 


9/iVo 


2(p  -  p/)gR2 


(4.13) 


(4.14) 


The  “Stokes  parameter”  name  attached  to  772  is  a  historical  label  used  for  this 
dimensionless  parameter,  but  this  classification  has  important  value  since  it  makes 
it  easier  to  search  for  other  results  on  related  problems  involving  this  parameter  in 
books,  journals,  and  other  sources.  Hence  dimensionless  parameters  make  it  possi¬ 
ble  to  universally  compare  results  from  different  studies  (experiments,  simulations, 
and  theory)  and  communicate  in  a  common  terminology  across  many  branches  of 
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Table  4.1  A  few  commonly  used  dimensionless  parameters 


Name 

Formula 

Competing  effects 

Area  of  study 

Arrhenius 

E/(  RT) 

Activation/potential  energy 

Thermodynamics 

Damkohler 

kL/(UCo) 

Reaction/transport  rates 

Thermodynamics 

Lewis 

Dt/Dm 

Thermal/mass  diffusivity 

Thermodynamic  s 

Mach 

V/c 

Char,  speed/sound  speed 

Aerodynamics 

Peclet 

UL/D 

Convection/diffusion 

Thermodynamics 

Reynolds 

pUL//r 

Inertia/viscosity 

Fluid  dynamics 

Stokes 

(4.14) 

Drag/gravity 

Fluid  dynamics 

science  and  engineering.  See  Table  4.1  for  a  short  list  of  some  of  the  many  other 
named  parameters,  also  see  [69]. 

The  nondimensionalized  problem  (4.13)  has  an  explicit  solution  which  conse¬ 
quently  depends  only  on  t  and  the  single  St  parameter: 


The  solution  of  the  original  dimensional  problem  can  then  be  written  in  terms  of 
(4.15)  and  (4.12)  as 

V(T)  =  V0v(f/T,  St) 

Vo  ,  w  I  \  (  vAp-pf) g.\ 

=  si  +  Vo(1_^jexprS 

=  +  v°<T^FPr.  (4.16) 

The  last  expression  is  what  would  have  been  obtained  from  solving  (4.10)  directly. 
As  should  be  expected,  its  functional  form  is  equivalent  to  (4.15),  but  (4.16)  makes 
it  difficult  to  see  that  the  solution  is  actually  only  dependent  on  a  single  parameter. 
Having  (4.13)  in  a  less  cumbersome  form  than  (4.10)  facilitates  using  it  to  identify 
the  steady  state  as  v  =  1  /St;  this  represents  the  “terminal”  free-fall  velocity  in  this 
problem  (Vo/St  =  2 (p  —  p/)gR2/(9/i)  in  dimensional  form). 

Here,  as  in  most  problems,  the  dimensionless  parameters  more  efficiently  and 
compactly  capture  the  dependence  of  key  properties  of  the  system  on  its  design  para¬ 
meters.  In  addition,  if  this  model  had  been  compared  against  experimental  data  for 
V  =  V(T,  R,  Vo,  pf,  p,  n,  g)  then  while  the  data  could  involve  many  experiments 
varying  the  six  parameters,  Eq.  (4.16)  shows  that  all  of  that  data  could  be  captured 
by  the  two  characteristic  scales  and  the  Stokes  parameter.  Dramatic  reduction  in 
re-organising  data  to  show  its  underlying  fundamental  structure  is  often  called  col¬ 
lapsing  of  data,  namely  when  many  data  points  from  different  runs  are  all  shown  to 
fall  on  universal  curves. 
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In  (4.1 1),  the  ODE  was  normalised  by  the  coefficient  of  the  inertial  term,  making 
77i  the  ratio  of  buoyancy  to  inertial  effects  and  772  the  ratio  of  drag  to  inertial  effects 
for  given  V,  T.  Setting  the  V,  T  characteristic  scales  to  normalise  /Ti ,  /T3  =  1  gives 
a  formula  for  St  in  terms  of  given  quantities  (4.14)  which  embodies  this  ratio, 


St= 

2(p-pf)gH2 


67r/iRV0  drag  force 

j7rR3(p  —  p/)g  net  gravity  force  ’ 


(4.17) 


We  want  to  re-iterate  the  very  important  point  that  all  dimensionless  parameters  are 
ratios  of  competing  effects  in  models.  This  is  a  consequence  of  all  77’ s  being  formed 
by  dividing  equations  in  the  model  by  scaling  coefficients  for  the  influence  of  one 
term.  In  general, 

strength  of  effect  1 

77  =  - - - .  (4.18) 

strength  of  effect  2 


The  limiting  cases  of  such  ratios  have  clear  interpretations: 

•  77  ->  0:  effect  1  is  very  weak  (relative  to  effect  2). 

•  77  — >  00:  effect  1  is  very  strong  (relative  to  effect  2). 

Between  these  extremes  other  special  critical  values  77c  can  exist  that  separate 
different  qualitative  regimes  for  the  solution’s  behaviour.  The  model  (4.13)  has  a 
critical  Stokes  number  of  Stc  =  1 : 

•  For  St  <  1  solutions  describe  balls  starting  from  lower  speeds,  that  will  accelerate 
up  to  the  terminal  velocity. 

•  For  St  >  1  solutions  represent  balls  starting  from  speed  highers  than  the  terminal 
velocity  decelerating  for  all  times. 

•  For  St  =  1,  the  solution  starts  at  and  maintains  the  terminal  velocity. 

This  is  a  simple  example  of  a  critical  value  of  a  parameter  marking  a  bifurcation.  In 
other  systems,  critical  values  can  signify  a  change  in  the  stability  of  a  solution,  or  act 
as  boundaries  of  parameter  regimes  in  which  different  numbers  of  solutions  coexist. 

Careful  examination  of  related  issues  will  be  given  in  upcoming  chapters  (see 
Chaps.  6-10)  using  asymptotic  analysis  and  perturbation  methods  to  solve  models  in 
small  parameter  limits.  For  the  moment,  we  note  that  apart  from  any  physical  interpre¬ 
tations,  the  limiting  cases,  77  =  0  and  77  =  00,  have  a  clear  practical  difference — an 
infinite  coefficient  precludes  any  calculations  using  the  “usual  methods”.  In  (4.13), 
having  St  =  0  yields  a  well-defined  solution,2  v(t)  =  l+t,  while  (4.13)  with  St  =  00 
is  a  singular  limit  that  can  not  be  sensibly  evaluated  directly.  As  already  described 
in  connection  with  (4.8),  the  remedy  for  this  is  to  rescale  the  problem. 


■This  limiting  solution  could  also  be  obtained  using  L’Hopital’s  rule  for  the  St  ^  0  limit  of  (4.15). 
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We  note  that  the  limit  St  ->  0  in  (4.14)  corresponds  to  several  different  physical 
limiting  situations: 

•  Low  fluid  viscosity,  fi  — >  0, 

•  Low  initial  ball  velocity,  Vo  — >  0, 

•  High  ball  density,  p  — >  oo, 

•  Large  ball  size,  R  ->  oo. 

For  any  of  these  physical  regimes,  (4.13)  gives  a  well-defined  model  capable  of 
representing  the  limiting  behaviour.  Conversely,  for  the  opposite  extremes  (i.e.  high 
viscosity  or  high  speed  or  low  ball  mass),  (4.13)  would  have  a  divergent  coefficient 
and  should  not  be  used  in  current  form. 

For  example,  consider  the  limit  of  high  fluid  viscosity  that  would  be  problematic 
for  (4.13).  Setting  772  =  1  and  77 3  =  1  in  (4.11)  selects  the  characteristic  scales 


V  =  V0, 


and  the  new  form  of  the  scaled  model: 


dv 

dt 


=  n  1  -  v, 


V(0)  =  1, 


(4.19) 


(4.20) 


with  77i  =  1/St  and  yields  the  limiting  solution  v(t)  =  e  1  for  St  =  00. 


4.3.3  The  Burgers  Equation 


To  illustrate  nondimensionalization  for  a  partial  differential  equation,  we  consider  a 
problem  for  the  Burgers  equation , 


<9U  uau-na2u 

Tr  +  Uax  -Dax2  ’ 


0  <  X  <  E, 


with  the  boundary  and  initial  conditions, 


U(0,  T)  =  A,  <9XU(E,  T)  =  B,  U(X,  0)  =  C, 


(4.21a) 


(4.21b) 


where  A-E  are  given  constants  with  appropriate  dimensional  units.  In  Chap.  2,  the 
inviscid  Burgers  equation  (2.39)  was  introduced,  which  is  a  special  case  of  (4.21a) 
with  D  =  0.  Equation  (4.21a)  occurs  in  models  for  problems  in  fluid  dynamics 
(where  U  represents  a  velocity)  and  other  chemical  and  physical  systems,  where 
U  represents  other  physical  quantities.  Our  analysis  of  the  non-dimensionalization 
applies  irrespective  of  the  units  of  U. 
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We  must  choose  dimensional  scaling  constants  U,  L,  and  T  in 

U(X,  T)  =  XJu(x,  t),  X  =  Lx,  T  =  T t,  (4.22) 


to  yield  variables  u,x,t  that  are  dimensionless.  Using  (4.22)  in  the  form  U(X,  T)  = 
Uu(X/L,  T /T)  and  employing  the  chain  rule  leads  to  the  transformed  problem 


U\  du  /U2\  du  _  /DU\  d2u 

T  /  dt  \  L  /  dx  \  L2  /  dx 2  ’ 


0  <  Lx  <  E, 


Uw(0,  t)  =  A,  (U/L)d*M(E/L,  t)  =  B,  U u(x,  0)  =  C. 


Dividing  across  by  one  coefficient  in  each  expression,  we  arrive  at  the  nondimensional 
form 


du 


0  <  x  < 


(4.23) 


77i  772 


n3 


u( 0,  t) 


u  (x ,  0) 


n6 


In  (4.22),  we  have  three  scaling  constants  whose  values  we  can  choose  to  set  (an 
arbitrary  choice  of)  three  out  of  the  six  the  77’s  in  (4.23)  to  unity,  say 


n3  =  i 

n4  =  l 


77i  =  1 


>  L  =  E, 

*  U  =  A, 

*  T  =  L/U  =  E/A. 


(4.24) 


The  characteristic  length-,  solution-  and  time-scales  determined  in  (4.24)  can  be 
separated  into  two  types:  L,  U  are  imposed  scales,  while  the  timescale  T  is  a  derived 
scale. 

The  final  scaled  version  of  the  problem  is  then 


with 


du 

~dt 


T  u 


du 

dx 


0<jc  <  1, 


(4.25a) 


u( 0,  t)  =  1,  ux(  1,  t)  =  n5, 


u(x,  0)  =  776 , 


(4.25b) 


D 

AE’ 


BE 

X’ 


C 

A' 


n2  = 


n5  = 
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The  solution  of  this  problem  will  be  a  function  of  the  independent  variables  and  the 
three  remaining  77  parameters  and  gives  the  solution  of  the  original  problem  (4.21) 
via  (4.22)  as 


u  =  Au(x,  t,  n2,  775,  776),  X  =  Ex,  T  =  (E/A )t.  (4.26) 

This  gives  the  most  general  explicit  understanding  of  how  the  solution  depends  on 
the  constants  defining  the  original  problem.  In  contrast,  without  the  use  of  scaling 
analysis,  the  best  that  can  be  said  in  general  about  the  solution  of  the  original  problem 
(4.21)  is  that  it  will  have  some  dependence  on  each  parameter  and  variable, 

U  =  U(X,  T,  A,  B,  C,  D,  E). 


The  nondimensionalization  above  proceeded  by  setting  77i ,  773  and  774  equal  to 
unity.  Setting  77i  =  1  yielded  T  =  L/U,  which  is  known  as  the  convective  timescale 
(the  time  taken  to  travel  a  distance  L  at  velocity  U).  An  alternative  choice  for  a  derived 
time-scale  results  if  we  set  772  =  1  instead  of  77i  =  1 ;  the  characteristic  time  is  then 
T  =  L2/D  and  is  called  the  diffusive  timescale  (the  time  taken  for  diffusive  effects 
to  propagate  throughout  the  domain).  The  new  form  of  the  scaled  problem  becomes 


with 


du  ~  du  d2u 
dt  1  dx  dx2  ’ 


0  <  *  <  1, 


(4.27a) 


w(0, 0  =  1,  ux(l,t)  =  n5,  u(x,0)  =  n6,  (4.27b) 


AE 

~D"’ 


C 

A' 


It  is  notable  that  77 1  is  the  inverse  of  772  from  the  first  choice  of  scalings.  As  explained 
earlier,  a  dimensionless  parameter  always  represents  the  ratio  of  two  competing 
effects;  in  this  case  convection  and  diffusion.  The  relative  importance  of  these  effects 
is  therefore  measured  by  the  ratio 


convective  effects 

Pe  =  - 

diffusive  effects 


UL 

~cT’ 


(4.28) 


called  the  Peclet  number.  We  can  write  77i  =  Pe  or  772  =  1  /Pe  in  their  respective 
models  to  put  the  problem  in  forms  comparable  to  other  heat  and  mass  transfer 
problems  studied  in  engineering. 

Whether  (4.25),  (4.27)  or  some  other  choice  of  scaled  problem  is  appropriate  for 
the  questions  of  interest  is  resolved  by  the  second  scaling  principle  (4.8).  We  should 
choose  the  system  (4.27)  if  we  consider  the  limit  Pe  ^  0  (77i  — >►  0)  (convective 
effects  being  less  important  than  diffusive  effects)  and  the  system  (4.25)  for  the  limit 
Pe  — >  oo  (772  — >  0)  (convective  effects  being  more  important  than  diffusive  effects). 
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4.4  Further  Applications  of  Dimensional  Analysis 
4.4.1  Projectile  Motion  (Revisited) 

Dimensional  analysis  can  sometimes  be  used  to  identify  the  solution  structure  of 
problems  without  even  writing  down  model  equations.  Although  this  may  sound 
simpler  than  the  approach  we  have  outlined  above,  the  absence  of  equations  derived 
from  physical  laws  requires  us  torely  heavily  on  experience  and  intuition  for  guid¬ 
ance.  With  that  caveat  in  mind,  we  now  construct  the  form  of  the  solution  to  a 
projectile  problem  without  first  deriving  a  model  equation. 

Consider  a  projectile  of  mass  M  kg  launched  at  an  angle  a  up  from  the  ground 
with  a  given  speed  V  m/s.  At  some  later  time,  the  projectile  will  return  to  the  ground. 
Suppose  that  we  wish  to  calculate  the  time  of  flight  of  the  projectile  T  (s).  Besides 
the  given  parameters  describing  the  launch  of  the  object  (M,  V,  a ),  we  expect  the 
acceleration  due  to  gravity,  g  m/s2,  to  also  play  a  key  role. 

Consequently,  we  may  expect  that 

T  =  F(M,  V,  g,  a)  (4.29) 

for  some  dimensional  function  F.  With  respect  to  units  of  time,  the  only  way  to 
construct  a  dimensionless  parameter  in  terms  of  the  given  quantities  is  Tl\  =  gT/V. 
Then,  corresponding  to  (4.29),  the  dimensionless  time  must  given  by  a  dimensionless 
function  / 

Y  =  /(M,V,g,a).  (4.30) 

Of  the  parameters  in  /,  M,  V  and  g  are  independent  in  that  we  cannot  write  the 
dimensions  of  one  of  these  quantities  as  a  combination  of  the  dimensions  of  the  others 
(also  note  that  a  is  an  angle  and  so  already  a  dimensionless  quantity).  We  cannot 
therefore  construct  any  further  nondimensional  quantities  by  combining  them.  This 
is  not  consistent  with  the  requirement  that  /  be  a  nondimensional  function  and  so 
implies  that  /  must  be  independent  of  all  of  these  quantities,  yielding  simply  that 

T=^/(a).  (4.31) 

We  have  thus  simplified  a  dimensional  function  of  four  quantities  down  to  a  nondi¬ 
mensional  function  of  just  one,  and,  in  addition,  have  shown  that  T  should  be  inde¬ 
pendent  of  the  projectile  mass  M  without  reference  to  any  model  equation. 

Although  this  analysis  does  not  provide  the  form  of  the  function  /(a),  we  can 
immediately  see  that  T  is  linearly  dependent  on  the  initial  velocity  and  inversely 
dependent  on  g;  which  is  precisely  what  we  obtained  in  Sect.  4.3.1. 

In  the  above  discussion,  we  ignored  the  influence  of  air  resistance  on  the  dynamics 
of  the  projectile.  If  we  include  the  effects  of  air  resistance  so  that  the  projectile 
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experiences  a  retarding  force  linearly  proportional  to  the  velocity  with  proportionality 
constant  K  (measured  in  units  of  kg/s),  Eq.  (4.29)  would  now  take  the  form 

T  =  F(M,  V,  g,  a,  K).  (4.32) 


In  contrast  with  (4.31),  it  is  now  possible  to  combine  M,  V,  g  along  with  K  into  a 
dimensionless  product.  We  consequently  obtain 


gT 

v 


(4.33) 


namely  a  dimensionless  function  of  two  dimensionless  parameters  based  on  inde¬ 
pendent  combinations  of  the  given  quantities  describing  the  problem.  Note  that  we 
do  not  combine  the  dimensionless  parameter  a  with  the  other  dimensionless  prod¬ 
uct;  as  a  could  be  the  exponent  of  a  non-monomial  function,  say  sin  or  log,  and  this 
whole  function  would  remain  nondimensional  (and  undetermined  through  dimen¬ 
sional  analysis). 

The  introduction  of  the  air  resistance  constant  of  proportionality  K,  also  provides 
another  way  to  generate  a  quantity  with  dimensions  of  time  and  this  allows  us  to 
write 


KT 

Ivf 


(4.34) 


At  first  glance,  it  appears  that  we  have  two  functional  forms  for  the  same  expression, 
but  in  fact  (4.33)  and  (4.34)  are  equivalent,  where  we  have  (multiplying  through  by 
the  nondimensional  product) 


Mg  KT  gT  Mg  ,  /  Mg\  ~  /  Mg\ 

KV  M  V  V*’  KV/  1  V’  K V/ 

The  time  of  flight  T  can,  in  fact,  be  shown  to  be  given  implicitly  by 

M  /  Mg\  kt/m  MT 

This  equation  cannot  be  inverted  to  give  an  explicit  form  for  T.  But  (4.36)  can  be 
expressed  in  the  form  E(/7i,  772,  773 )  =0,  with  77i  =  KT /M,  which  subsequently 
leads  to  (4.34).  Thus  (4.34)  provides  us  with  an  understanding  of  the  qualitative 
dependence  of  T  on  all  quantities  in  the  system  and  a  more  detailed  quantitative 
solution  would  involve  the  numerical  solution  of  (4.36). 


(4.35) 


(4.36) 
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4.4.2  Closed  Curves  in  the  Plane 


For  our  final  example,  we  touch  on  how  dimensional  analysis  connects  with  the  more 
general  concept  of  similarity  that  we  will  describe  in  detail  later  in  Sect.  4.5.1. 

Simple  (non-intersecting)  closed  curves  can  be  associated  with  two  quantities: 
an  enclosed  area  A,  and  a  perimeter  P.  In  terms  of  dimensions,  [A]  =  L2  and  the 
perimeter  [P]  =  L,  from  which  we  can  form  one  dimensionless  parameter  TJ\  = 
P2/A  (the  dimensionless  perimeter  to  area  ratio).  For  a  circle  of  radius  R,  this  ratio 
reduces  to 


(4.37) 


Note  that  this  is  the  only  dimensionless  parameter  for  circles  and  it  is  independent 
of  the  radius  of  the  circle. 

For  a  square  with  sides  of  length  L,  the  perimeter  to  area  ratio  is 


(4.38) 


Consequently,  all  squares  are  also  similar  to  each  other.  Noting  that  16  >  Air,  we 
observe  that  for  a  given  perimeter  length,  circles  enclose  a  greater  area  than  that 
achieved  by  squares. 

Triangles  with  sides  of  length  L,  M,  N  have  perimeter  P  =  L  +  M  +  N  and 
Heron’s  formula  gives  the  area  as 


Consequently,  the  perimeter  to  area  ratio  can  be  written  as 


m  = 


i 


A  0 

L\ 

( 1  Mi 

( 1  N1 

/  2  V2 

P/ 

V2  p) 

V2  p) 

(4.39) 


In  this  expression,  we  have  several  dimensionless  ratios  of  lengths:  772  =  L/P, 

773  =  M/P,  and  II4.  =  N/P.  As  77i ,  772  are  independent  parameters  and  noting  that 

774  can  be  written  as  774  =  1  —  772  —  773  (using  the  equation  for  P),  we  can  write 

77i  =  1  - .  (4.40) 

V  2  (2  -  ^2)  (1  -  ^3)  (772  +  773  -i) 


3 In  Chap.  3,  we  showed  that  the  calculus  of  variations  can  be  used  to  prove  that,  among  all  smooth 
simple  closed  curves,  the  circle  uniquely  minimises  the  ratio  TJ\ . 
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4.5  The  Buckingham  Pi  Theorem 

We  conclude  this  chapter  by  reviewing  the  mathematical  theory  behind  dimensional 
analysis,  known  as  the  Buckingham  Pi  theorem.  While  dimensional  analysis  can  be 
applied  directly  without  this  theorem,  its  results  can  be  used  to  predict  the  number 
of  dimensionless  parameters  in  a  problem  and  further  reduce  calculations  involved. 

For  any  system  described  by  n  dimensional  quantities  Qi ,  Q2,  . . . ,  Q«  involving  r 
independent  base  dimensional  units,  [U 1  ] ,  [U2], . . . ,  [Ur],4 *  two  results  are  universal: 

(i)  A  set  of  dimensionless  parameters  77/,  y  =  1,2,...  (sometimes  also  called 
“dimensionless  groups”)  can  be  written  as  monomial  products  in  terms  of  powers 
of  the  Q/’s: 

/7  =  Q?Q^Q7...  (4.41) 

Since  each  77  must  be  dimensionless,  they  must  satisfy: 

[77]  =  [Chf  [Q2AQ3]7  •  •  •  =  [U1]°[U2]°  •  •  •  [Ur]°  (4.42) 

This  yields  a  set  of  r  homogeneous  linear  equations  for  the  exponents  a,  /?,  7,  . . . , 
one  for  each  of  the  [U/d  units,  k  =  1,  2,  . . . ,  r. 

Let  f  be  the  number  of  linearly  independent  equations  in  this  set,  with  r  <  r. 
Of  the  n  exponents  (a,  (3,  7,  . . .),  r  of  them  will  be  determined  from  the  dimen¬ 
sional  relations  (4.42)  in  terms  of  the  remaining  n  —  r  exponents,  which  are 
un-determined  from  this  analysis. 

(ii)  All  relations  between  the  original  quantities,  Qi,  Q2,  •  •  • ,  Q«  will  be  repre¬ 
sented  by  corresponding  relations  among  the  n  —  r  dimensionless  parameters, 
77i ,  772,  ■  ■  •  hi n — r  • 

F(77i,772,...,77„_r~)  =  0.  (4.43) 

The  structure  of  the  relations  is  dependent  upon  the  specific  form  of  each 
problem.  □ 

At  its  heart,  the  77  theorem  is  a  consequence  of  linear  algebra  applied  to  the 
dimensional  units  of  the  Q’s,  with  the  exponents  in  the  77’ s  being  analogous  to 
coefficients  of  a  set  of  linearly  independent  vectors  (whose  roles  are  played  by  the 
base  units  U^).  Different  presentations  and  more  detail  on  the  77 -theorem  can  be 
found  in  [9,  10,  49,  64]. 


4The  base  units,  Lk  in  the  ft  theorem  need  not  be  fundamental  units,  they  can  be  derived  units,  just 

as  long  they  form  a  linearly  independent  set. 
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4.5.1  Mathematical  Consequences 

While  the  Buckingham  77  theorem  and  the  other  observations  made  earlier  in  this 

chapter  are  straightforward,  they  lead  to  important  consequences: 

(i)  For  problems,  where  the  solution  is  a  function,  the  Pi  theorem  predicts  that  the 
non-dimensional  form  of  the  solution  will  depend  on  any  independent  variables 
and  the  77  groups, 

u  —  u  (x ,  t ,  77 1 , 77 ^ ,  . . . ,  lln y  ^ . 

To  write  the  solution  of  the  original  problem  in  terms  of  this  dimensionless 
solution,  characteristic  scales  are  needed  for  all  of  the  dependent  and  indepen¬ 
dent  variables.  Analogous  to  (4.42),  choices  for  these  scales  in  terms  of  the 
given  quantities  of  the  problem  are  obtained  by  solving  for  the  exponents  in 

[Qi]“[Q2]/3[Q3]7...  =  [U;]1,  (4.44) 

namely,  the  options  for  characteristic  scales  are  determined  by  solving  the  linear 
system  with  the  entry  for  one  dimension  set  to  unity  while  the  other  righthand 
side  entries  are  still  zero,  as  in  (4.42). 

(ii)  If  77i  is  a  dimensionless  parameter  for  a  system,  then  so  is  fl\  =  aTJ^  for 

/V  w 

any  numbers  a ,  / 3 ,  and  even  more  generally,  so  is  77i  =  /z(77i)  and  77i  = 
g(77i,  772)  for  any  functions  g  and  h.  This  illustrates  the  very  large  degree  of 
non-uniqueness  in  specifying  dimensionless  parameters.  However,  this  being 
said,  once  choices  are  made  for  the  77’s,  they  allow  for  systematic  analysis  of 
comparable  systems. 

(iii)  Two  systems  are  called  similar  if  they  reduce  to  the  same  dimensionless  prob¬ 
lem  with  the  same  values  for  all  of  the  TJ  ’s  in  (4.43).  Consequently,  two  similar 
systems  will  have  the  same  dimensionless  properties  and  their  dimensional 
properties  will  be  proportionally  scaled.  For  example,  in  the  context  of  trian¬ 
gles  (from  Sect.  4.4.2),  two  triangles  are  similar  if  they  share  the  same  values 
of  77i,  772,  773;  this  is  consistent  with  the  description  for  similar  triangles  in 
plane  geometry. 

This  principle,  sometimes  called  the  similitude  condition  (or  “dynamic  simil¬ 
itude”)  is  the  basis  for  constructing  smaller-scale  inexpensive  prototypes  for 
testing  and  predicting  the  properties  of  full-sized  systems. 

(iv)  Under  broad  conditions,  when  we  can  assume  the  implicit  function  theorem 
applies  to  (4.43),  then  the  values  of  some  dimensionless  parameters  (or  sys¬ 
tem  properties)  can  be  expressed  as  functions  of  the  values  of  the  others.  For 
example,  say  F(77i,  772,  773)  =0,  then  we  may  able  to  obtain 


773  =  /(771,772). 


4.5  The  Buckingham  Pi  Theorem 


103 


In  particular,  note  that  (4.40)  is  an  example  of  a  specific  relation  of  the  form 
77i  =  /(772,773). 

The  77 -theorem  will  not  determine  the  solution  function  /,  but  it  does  help 
greatly  by  telling  us  how  many  independent  parameters  it  depends  on. 

As  described  earlier,  there  are  many  different  possible  sets  of  77’s  that  can  be 
employed  to  describe  each  problem.  Useful  choices  depend  on  which  system  prop¬ 
erties  are  known  versus  which  ones  are  being  sought. 


4.5.2  Application  to  the  Quadratic  Equation 

We  now  apply  the  77  theorem  in  detail  to  an  elementary  problem  to  illustrate  a  broader 
idea — while  the  theorem  follows  from  dimensional  analysis,  it  can  be  applied  to  any 
mathematical  problem  to  determine  the  essential  dependence  of  solutions  on  the 
given  parameters  of  the  system. 

One  context  in  which  the  quadratic  equation, 

AT2  +  BT  +  C  =  0,  (4.45) 

arises  is  (as  mentioned  previously)  in  the  kinematic  study  of  an  object  moving  at 
a  constant  acceleration,  where  T  would  be  the  time  of  flight  (an  unknown,  to  be 
solved  for)  and  A,  B,  C  are  given  dimensional  constants  related  to  the  acceleration, 
initial  velocity  and  initial  position  respectively.  In  this  context,  the  dimensions  of  the 
quantities  in  (4.45)  are 

[T]  =  T,  [C]  =  L,  [B]  =  L/T,  [A]  =  L/T2. 

Following  (4.41),  we  can  denote  dimensionless  parameters  in  terms  of  these  quanti¬ 
ties  as 

n  =  AftBJC'T'5.  (4.46) 

for  appropriate  values  of  the  exponents.  Noting  that  the  only  dimensions  that  appear 
in  this  problem  are  L  and  T,  for  77  to  be  dimensionless  requires  that  [77]  =  L°T°, 
and  so 


(L/T2)“(L/T)/T7T<5  =  L°T° 


j^Q;+/3+^)/'y — 2cy — /3~\~5  j  O'yO 


The  four  unknown  exponents  a,  (3,  7  and  S  must  satisfy  the  two  dimension- 
independence  equations 

Li  a  T  f3  T  7  =  0  5 
T  :  -2a-  0  +  6  =  0. 
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Standard  theory  from  linear  algebra  then  gives  that  each  solution  will  have  two  free 
parameters  that  can  be  set  arbitrarily,  with  two  variables  being  determined  in  terms 
of  the  other  two  variables.  This  exactly  corresponds  to  the  77  theorem’s  count:  4 
quantities  —2  independent  dimensions  =  2  parameters. 

There  are  many  possible  choices  for  selecting  the  77 ’s.  However,  as  explained  in 
Sect.  4.3.1,  there  are  conventions  that  lead  to  specific  choices  that  make  the  mathe¬ 
matical  formulation  more  easily  interpretable  than  the  original  problem. 

For  the  current  example,  we  seek  a  dimensionless  variable  corresponding  to  T:  in 
(4.46),  if  we  pick  S  =  1  and  for  convenience  take  7  =  0,  then  we  have  a  =  1 ,  f3  = 
—  1,  giving,  lli  =  A1  B~1T1 .  Since  [77i]  =  1  we  see  that  [T]  =  [B/A],  namely  that 
the  ratio  of  the  given  quantities  B/A  determines  a  characteristic  scale. 

Excluding  the  variable  T,  the  example  has  3  given  quantities  (A,  B,  C) — 2  dimen¬ 
sions  (L,  T)  =  1  independent  system  parameter.  Setting  S  =  0  in  (4.46)  and  for 
convenience  picking  7=1  yields  a  =  1,  (3  =  —  2  and  772  =  AC/B2. 

Employing  the  second  result  from  the  Buckingham  theorem,  the  dimensionless 
groups  should  all  be  related,  E(77i ,  772)  =  0.  Applying  the  implicit  function  theorem 
to  F,  we  can  write 


and  hence 


n  1  =  f(n2) 


AT 

"b“ 


=  f(n2). 


(4.47) 


We  now  compare  this  against  the  standard  result  for  the  solution  of  the  quadratic 
equation  (4.45): 

T  -B  ±  VB2  -  4AC  — B  ±  a/B2(1  —  4AC/B2) 

T  = - — - = - - - 7 - - — -  (4.48) 

2A  2A 

Even  though  the  Buckingham  theorem  cannot  predict  the  details  of  the  function 
/(772),  (4.47)  has  indicated  the  way  in  which  the  solution  of  (4.45)  depends  on  the 
given  quantities. 


4.6  Further  Directions 

Further  use  of  dimensional  analysis  for  mathematical  modelling  is  well  illustrated 
in  Chap.  1  of  Holmes  [49]  and  at  a  more  advanced  level  in  the  books  of  Barenblatt 
[9,  10]  and  Sedov  [88].  In  addition,  the  book  by  Szirtes  [95]  contains  an  extensive 
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number  of  worked  examples  in  dimensional  analysis  taken  from  a  wide  range  of 
areas. 

Models  for  projectile  motion  have  been  used  as  a  classic  example  of  dimensional 
analysis  in  many  books  on  modelling  [49,  64]  since  Lin  and  Segel  [63,  89].  This 
stems  from  its  broad  appeal  as  a  fundamental  problem  in  physics  as  well  as  the  large 
number  of  basic  modelling  concepts  it  connects  to.  Likewise,  the  Burgers  equation 
is  a  partial  differential  equation  that  plays  an  important  role  in  many  applications  in 
applied  mathematics  beyond  its  original  setting  in  fluid  dynamics  [106],  and  will  be 
re-visited  in  several  later  chapters. 

Finally,  in  our  introductory  discussion,  we  have  barely  touched  on  the  relation  of 
dimensional  analysis  to  similitude  and  scale  models — how  to  construct  a  model  ship, 
for  example,  with  properties  that  when  scaled  up  will  be  consistent  with  those  of  the 
actual  ship.  The  classic  text  by  Pankhurst  [83]  has  a  clear  and  concise  introduction 
to  this  topic  written  from  an  engineering  perspective. 


4.7  Exercises 


4.1  ( Choosing  scalings  for  different  limits)  Consider  the  dimensional  problem  for 
the  motion  of  a  projectile  launched  from  close  to  the  surface  of  the  Earth: 


d2 Y  _  GMe 

dJ2  ~  ~  (Re  +  Y)2  ’ 


Y(0)  =  2  m, 


Y^O)  =  —Vq  m/s. 


Assume  the  Earth  to  be  spherical  with  a  uniform  density,  Me  =  |7rR|pE- 

Let  Y(T)  =  L y(t)  and  T  =  T t.  Consider  the  following  cases: 

(i)  The  fast  projectile  limit:  Re  =  fixed,  Vo  — >  oo,  pE  =  fixed. 

(ii)  The  dense  Earth  limit:  Re  =  fixed,  Vo  =  fixed,  ^  oc. 

(iii)  The  light  Earth  limit:  Re  =  fixed,  Vo  =  fixed,  pe  — >  0. 

(iv)  The  small  Earth  limit:  Re  — >  0,  Vo  =  fixed,  Me  =  fixed. 

For  each  case: 

(a)  Choose  characteristic  scalings  L,  T  to  normalise  as  many  terms  as  possible. 

(b)  Choose  the  scalings  so  that  the  time  it  takes  for  the  projectile  to  fall  to  should 
be  finite  for  the  given  limit.  Namely,  the  speed,  acceleration,  and  initial  height 
should  not  diverge,  nor  should  falling  (say  down  to  1  m)  take  infinitely  long  in 
time  (as  in  for  example  if  the  velocity  and  acceleration  both  approach  zero)  or 
happen  nearly  instantaneously  (if  the  initial  height  approaches  zero). 

(c)  Write  the  scaled  problem  and  identify  all  remaining  dimensionless  parameters. 

(d)  Identify  the  limiting  small  parameter  and  for  each  case,  write  the  problem  (called 
the  leading  order  problem)  when  the  parameter  is  set  to  zero. 
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Fig.  4.1  The  elementary  problem  of  a  mass  on  spring  with  an  applied  force:  (Left)  dimensional 
parameters,  (Right)  solutions  X(T)  for  various  parameters  and  (Inset)  the  amplitude  in  relation  to 
the  forcing  frequency 


4.2  The  governing  equation  for  the  linear  mass-spring  system  shown  in  Fig.  4.1  is 


M 


d2X 
dJ 2 


+  KX  =  F  sin(1f2T), 


X(0)  =  Xo, 


dX 

dJ 


T=0 


where  X(T)  [m]  is  the  position  of  the  mass  as  a  function  of  time  with  mass  M  [kg],  and 
spring  coefficient  K  [N/m],  magnitude  of  the  applied  force,  F  [N],  forcing  frequency, 
Q  [s_1],  as  well  as  general  initial  conditions, 

Nondimensionalize  and  select  length-  and  time-scales  L,  T  to  normalise  the  coef¬ 
ficients  of  the  terms  on  the  left  side  of  the  ODE  and  the  initial  condition  for  position. 
Write  and  solve  the  scaled  problem,  and  identify  the  dimensionless  parameter  for  the 
ratio  of  the  forcing  frequency  to  resonant  frequency,  where  the  solution’s  amplitude 
grows  without  bound,  as  shown  for  in  Fig.  4.1. 


4.3  Consider  the  initial  value  problem  for  a  damped  driven  nonlinear  oscillator: 


M 


d2X  dX  ,  .  T  A  dX 

+  B-+KX=  =  FS,„«2D,  X(0)=A,  - 


=  c. 


T=0 


(a)  If  the  units  are  [M]  =  kg,  [T]  =  s  and  [X]  =  m,  state  the  units  for  A,  B,  K,  F,  Q . 

(b)  Consider  the  limit  M  — >  0.  Let  X(T)  =  L x(t)  and  T  =  Tt  where  L,T  are 
characteristic  scale  constants.  Determine  choices  for  L,T  that  yield  an  ODE  for 
v  with  all  three  terms  on  the  left  side  of  the  equation  normalised. 

(c)  Identify  the  dimensionless  parameters  in  this  scaled  problem. 


4.4  Consider  the  dimensional  equation  for  the  damped  pendulum 


d20 

7f 2 


+  ^  sin  0  =  0, 


0(0)  =  0O, 


©'(0)  =  S2q, 
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where  B  is  the  damping  coefficient,  g  gravity,  L  length,  and  ®o,  initial  angle 
and  speed.  Nondimensionalize  in  terms  of  general  scalings  for  the  position  and  time: 
&{T)  =  @6(t),T  =  T  t.  Determine  the  form  of  the  scaled  model  and  the  appropriate 
characteristic  scales  for: 

(i)  The  weakly  damped  limit:  B  — >  0. 

(ii)  The  slow  speed  limit:  £2$  — >  0. 

(iii)  The  small  amplitude  oscillation  limit:  ®o  — >  0. 

4.5  Consider  dimensional  equations  describing  a  system  of  chemical  reactions  for 
the  concentrations  of  three  chemicals: 

dX  A 

=  A  -  BY  X(0)  =  X0 

dJ 

dl  =  CX-DZ  Y(0)  =  Y0  (4.49) 

d\ 

=  EY  -  FY2  +  GY3  -  HZ  Z(0)  =  Z0 

d\ 

where  A,  B,  C, . . .  H  are  given  dimensional  constants.  By  scaling  X,  Y,  Z,  T  (i.e. 
X(T)  =  Xx(t)  and  similarly  for  the  others)  show  that  these  equations  can  be  non- 
dimensionalized  in  the  form: 

dx 

-=a-y  x(0)  =  fi 

dt 

P-^=x-z  y(0)  =  (J  (4.50) 

dt 

dz  o  1  o 

7—  =  y-y2  +  - Sy 3  -  z  z(0)  =  u 
dt  3 

(a)  Determine  the  characteristic  scalings  X,  Y,  Z,  T  and  the  dimensionless  parame¬ 
ters  in  terms  of  A-H  and  Xo,  Yo,  Zo. 

(b)  If  7  =  0  show  that  the  problem  reduces  to  a  system  of  two  ODEs  in  terms  of 
x(t)  and  y(t)  only.  Find  the  relation  that  the  initial  conditions  must  satisfy  in 
order  to  avoid  a  contradiction. 

(c)  If  (5  =  0  show  that  the  problem  reduces  to  single  first-order  ODE  for  y(t) 
alone.  Find  the  relation  that  the  initial  conditions  must  satisfy  in  order  to  avoid 
a  contradiction. 

4.6  Consider  the  dimensional  problem  for  U(X,  T): 


<9U 

<9T 


cx2t3 


X  >  0  T  >  0 


U(0,  T)  =  0,  U3(0,T)  +  DUXxx(0,T)  =  E, 


where  A,  B,  C,  D,  E  are  given  constants. 
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(a)  If  X  is  measured  in  meters,  T  is  measured  in  seconds,  and  U  is  a  density  function, 
measured  in  kg/m3,  determine  the  units  for  A-E. 

(b)  Determine  L,  T,  U  in  terms  of  A-E  in  the  dimensional  scaling  U  =  U u(x,  t) 
with  X  =  Lx,  T  =  Tt  so  that  all  of  coefficients  in  the  PDE  are  normalised. 
Write  the  complete  nondimensionalized  problem  and  identify  the  dimensionless 
parameters. 

4.7  The  shallow  water  equations  are  a  system  of  two  PDEs  describing  fluid  flow  in 
shallow  (long,  slender)  layers,  like  rivers.  In  dimensional  form,  they  are 

djH  +  H<9XU  +  U<9XH  =  0, 

&rU  +  U<9XU  +  g<9xH  =  0, 

where  U(X,  T)  is  the  fluid  speed,  H(X,  T)  is  the  height  of  the  fluid  layer,  and  g  is  the 
acceleration  due  to  gravity. 

Nondimensionalize  using  the  average  depth  of  a  river  H,  and  average  speed  U  and 
lengthscale  L 

H  =  Hh,  U  =  U  w,  X  =  Lx,  T  =  Tt. 


Select  the  timescale  T  to  normalise  all  terms  in  the  h  equation  and  determine  the 
only  remaining  dimensionless  parameter,  called  the  Froude  number  (Fr). 


4.8  One  model  for  the  motion  of  a  projectile  of  mass  M  launched  from  the  origin 
at  an  angle  a  to  the  horizontal  with  initial  velocity  V  is  given  by  the  solution  to  the 
vector  equation, 


M 


d2X 

7f 2 


-Mg  -  K 


dX 

df’ 


where  X  =  (X,  Y),  g  =  (0,  g)  and  K  is  a  coefficient  of  air  resistance. 

(a)  Use  dimensional  homogeneity  to  determine  the  units  of  K. 

(b)  In  the  absence  of  air  resistance  (K  =  0),  show  that  the  solution  to  this  equation 
is  consistent  with  (4.31). 

(c)  Show  that  Eq.  (4.36)  for  the  time  of  flight  T  can  be  obtained  from  this  equation 
when  the  air  resistance  is  directional  proportional  to  the  projectile’s  velocity 
(K  7^  0). 

(d)  Repeat  the  dimensional  analysis  from  the  example  in  the  text,  assuming  that  the 
air-resistance  is  directly  proportional  to  the  square  of  the  velocity,  (note:  you 
will  have  to  redefine  the  dimensions  of  K). 

(e)  Perform  a  similar  analysis  in  both  cases  to  derive  a  nondimensional  expression 
for  the  distance  travelled. 

4.9  Returning  to  the  relation  between  area  and  perimeter  from  Sect.  4.4.2,  consider 

these  questions  for  different  families  of  shapes: 

(a)  Show  that  for  all  rectangles  (with  length  L  and  width  W),  the  perimeter-to-area 
ratio  can  be  expressed  in  terms  of  a  length-to-width  aspect  ratio  parameter. 
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(b)  For  ellipses  having  semi-major  and  semi-minor  axes  of  L,  W,  (X/L)2  + 
(Y /W)2  =  1,  the  area  is  given  by  A  =  7tLW  and  a  formula  by  Ramanujan  [85] 
gives  the  approximate  perimeter  as  P  ^  7r(3(L  +  W)  —  V (3L  +  W)(L  +  3W) ). 
Relate  the  dimensionless  perimeter  to  area  ratio  to  the  aspect  ratio.  What  can  be 
said  about  the  relations  between  a  rectangle  and  its  inscribed  ellipse? 

(c)  Use  (4.40)  to  show  that  all  triangles  have  77i  >  \2\[3. 

4.10  To  show  that  if  different  choices  are  made  for  77i,  772  in  (4.46),  the  results 

still  yield  the  form  of  the  quadratic  formula  (4.48).  Determine  77i,  772,  /(772)  if 

4  =  7=  1  in  77]  and  4  =  0,  7  =  1  in  772. 

4.11  Consider  the  cubic  equation, 

AT3  +  BT2  +  CT  +  D  =  0. 

(a)  Assuming  the  dimensions  are  [T]  =  T  and  [D]  =  L,  use  the  Buckingham  77 
theorem  to  determine  the  dimensionless  groups  for  this  equation. 

(b)  Using  (a)  write  a  formula  for  how  the  roots  T  depend  on  combinations  of  the 
coefficients. 

(c)  If  Xk  are  the  three  roots  of  the  equation  2X3  +  3X2  +  4X  +  5  =  0,  then  use  (b) 
to  determine  new  coefficients  in  the  following  equation 

Y3  +  5Y2  +  CY+D  =  0 


in  terms  of  the  previous  A-D  such  that  the  roots  are  double  the  roots  of  the  first 
equation,  =  2X^.  (Do  not  determine  the  values  of  the  roots  directly.) 

4.12  The  fluid  dynamics  problem  of  how  water  drips  from  a  faucet  depends  on 
several  quantities:  the  acceleration  due  to  gravity  g[m/s2],  the  density  of  water 
p[kg/m3],  the  viscosity  of  water  /i[kg/(ms)],  the  surface  tension  of  water  cr[kg/s2], 
the  radius  of  the  faucet  nozzle  R[m],  and  the  speed  of  the  water  coming  out  of  the 
faucet  U[m/s]. 

(a)  Use  the  Buckingham  77  theorem  to  determine  the  number  of  dimensionless 
parameters  and  write 

n  =  gApBncaDREUF 


to  determine  the  equations  relating  the  exponents  A-F . 

(b)  The  choice  of  dimensionless  parameters  to  describe  problems  is  not  unique.  This 
problem  can  be  described  in  terms  of  the  set  of  77  parameters: 


Re  = 


pUR 

p 


Ca  = 


pU 

a 


(called  the  Reynolds,  Bond,  and  Capillary  numbers)  or  in  terms  of  the  set 
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We  = 


Oh  = 


y/pR(T 


Ga  = 


3 


(the  Weber,  Ohnesorge,  and  Galileo  numbers). 

(i)  State  the  choices  for  A-F  for  each  parameter  and  show  that  each  of  these 
parameters  satisfies  the  equations  from  (a). 

(ii)  Show  that  the  3  parameters  in  each  set  are  independent  by  showing  that 
their  vectors  of  dimensional  exponents  v  =  (A,  B,  C,  D,  E,  F)  are  linearly 
independent. 


Part  II 

Solution  Techniques 


Chapter  5 

Self-Similar  Scaling  Solutions  of  Differential 
Equations 


Constructing  solutions  of  ordinary  and  partial  differential  equations  can  be  a  chal¬ 
lenging  process  (even  for  the  linear  case).  For  linear  problems,  techniques  such  as 
separation  of  variables,  and  Fourier  and  Laplace  transforms  can  often  be  used  to 
write  the  general  solution  of  a  differential  equation  that  can  then  be  specialised  to 
a  particular  problem  through  the  imposition  of  additional  (initial  and/or  boundary) 
conditions  to  generate  a  unique  solution.  Such  solutions  can  be  cumbersome  to  obtain 
and  may  still  require  substantial  further  analysis  in  order  to  gain  a  clear  insight  into 
the  nature  of  the  system’s  resulting  behaviour.  For  nonlinear  problems,  techniques  to 
obtain  general  solutions  exist  for  only  a  very  limited  number  of  types  of  equations. 

In  this  chapter,  we  will  show  that  the  scaling  analysis  introduced  in  the  context 
of  dimensional  analysis  in  Chap.  4  can  be  applied  to  obtain  special  solutions  of 
both  linear  and  nonlinear  partial  differential  equations.  These  solutions  are  called 
similarity  solutions  (sometimes  also  called  self- similar  solutions). 

It  will  be  shown  that  if  the  PDE  can  be  scaled  in  such  a  way  as  to  exactly  reproduce 
its  original  form  (as  suggested  by  the  term  ‘self-similar’),  then  it  will  possess  a 
class  of  solutions  that  share  this  property.  A  consequence  of  the  property  of  self¬ 
similarity  is  that  these  solutions  can  be  obtained  from  a  reduced  version  of  the 
original  problem.  For  a  PDE  with  two  independent  variables,  this  leads  to  an  ODE 
problem  for  the  similarity  solution.  Despite  their  nature  as  exact  solutions  to  a  given 
problem  under  special  conditions,  they  often  provide  important  understanding  of  the 
broader  behaviour  of  all  solutions  of  the  system. 

The  process  of  constructing  a  similarity  solution  to  a  given  problem  consists  of 
three  stages: 

(i)  Looking  for  a  scaling  “symmetry”  of  the  problem  to  see  if  similarity  solutions 
are  possible. 

(ii)  Determining  the  forms  of  the  similarity  variables  and  functions  from  the  scale- 
invariant  77  groups  for  the  problem. 
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(iii)  Solving  a  reduced  problem  for  the  similarity  function  and  then  transforming 
back  to  give  the  solution  of  original  problem. 

We  will  illustrate  the  stages  in  this  process  in  detail  and  then  go  on  to  apply  the 
method  to  further  examples. 


5.1  Finding  Scaling-Invariant  Symmetries 


Scaling  analysis  can  identify  whether  a  problem,  say  for  u  =  u(x,t ),  will  admit  a 
similarity  solution  by  making  use  of  the  change  of  variables 

u(x,  t)  =  \Ju(x,  t ),  x  =  Lx,  t  =  Tt,  (5.1) 

where  U,  L,  T  are  un-determined  real  positive  parameters.  We  call  the  problem  scale 
invariant  if  relationships  exist  between  the  scaling  parameters  U,  L,  T  in  (5.1)  that 
make  the  scaled  problem  take  exactly  the  same  form  as  the  original  problem  with  at 
least  one  scaling  parameter  remaining  undetermined, 

Problem  for  u  (x ,  t )  =  Problem  for  u  (x ,  t ) .  (5.2) 


For  evolution  equations,  it  expected  that  the  timescale  T  is  the  free  parameter  and 
the  other  scales  can  be  expressed  as  U  =  U(T)  and  L  =  L(T). 

As  an  example,  consider  the  inviscid  Burgers  equation  for  u(x,  t)  that  was  intro¬ 
duced  in  Chap.  2, 

du  du 

- h  u —  =0,  0  <  v  <  oo,  (5.3a) 

dt  dx 

subject  to  two  side  conditions, 


u( 0,  t)  =  0, 


*oo 


u(x ,  t )  dx  =  4. 


(5.3b) 


Substituting  the  change  of  variables  (5.1)  into  (5.3a,  5.3b)  yields, 


du  /UT\  „  du 

—  +  I  — —  I  u —  =0,  0  <  x  <  oo, 

dt  V  L  )  dx 


(5.4a) 


u{ 0,  t )  =  0, 


u(x,  t )  dx  =  4. 


oo 


(5.4b) 
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Setting  UT/L  =  1  and  UL  =  1  eliminates  the  scaling  constants  from  (5.4a,  5.4b) 
and  makes  the  scaled  system  identical  with  (5.3a,  5.3b)  and  hence  the  problem  is 
scale  invariant.  From  these  two  relationships,  we  can  express  U  and  L  in  terms  of  the 
free  parameter  T, 

L  =  T1/2,  U  =  T~l/2.  (5.5) 

Consequently,  if  we  have  one  solution  of  (5.3a,  5.3b),  say  u  =  u(x,  t ),  then  other 
solutions  are  given  by  the  one-parameter  continuous  family, 

u(x,  t)  =  T_1/2w(T_1/2v,  T ~lt),  (5.6) 

for  arbitrary  values  of  T.  This  transformation  is  called  a  scaling  symmetry  of  (5.3) 
[22,  52]. 

There  is  a  strong  resemblance  between  the  scaling  forms  (5.1)  and  (4.22),  but  there 
is  also  an  important  underlying  difference.  The  problems  in  this  chapter  will  be  taken 
to  already  be  in  dimensionless  form  (having  been  previously  nondimensionalized 
using  the  approach  described  in  Chap.  4).  The  scaling  parameters  in  (5. 1)  are  therefore 
dimensionless,  corresponding  to  the  magnitudes  of  solution-,  length-,  time-,  or  other 
relevant  scales. 


5.2  Determining  the  Form  of  the  Similarity  Solution 

Setting  T  =  1  in  (5.6)  returns  the  original  solution  and  provides  no  new  insight.  But, 
noting  that  T  is  an  arbitrary  parameter  and  not  part  of  the  original  specification  of  the 
problem  suggests  that  the  solution  should  be  independent  of  T,  or  more  precisely,  the 
solution  should  be  invariant  with  respect  to  changes  in  T.  We  can  use  this  principle 
to  yield  additional  information  on  the  form  of  the  solution. 

In  a  manner  analogous  to  that  used  for  dimensionless  parameters  in  Chap.  4,  we 
define  a  scale-invariant  parameter ,  77,  as  a  monomial  product  of  powers  of  the 
variables  of  the  system 

77  =  uaxbtc,  (5.7) 

which,  upon  substitution  of  (5.1),  is  independent  of  the  undetermined  scaling  para¬ 
meter  (T).  Analogous  to  the  notation  we  have  previously  employed  for  dimensions 
of  a  quantity,  let  [77]  represent  the  dependence  of  77  on  the  scalings  U,  L,  T  so  that 

[77]  =  UaLbTc  =  T°,  (5.8) 

and  in  particular  for  (5.5),  this  yields 


[77]  =  J-«/2Ti>/2Tc  =  T0^ 
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We  then  define  a  scale  invariant  independent  variable  (or  “similarity  variable ”)  and 
the  corresponding  scale  invariant  solution  (or  “similarity  function')  as  follows. 

The  similarity  variable  is  a  scale-invariant  product  77  that  does  not  depend  on  the 
solution  u ,  namely  a  =  0  in  (5.7).  Generally  it  is  most  convenient  to  choose  a  linear 
dependence  on  the  independent  spatial  variable,  b  =  1,  which  leads  to 

/7i  =  xtc  =*  [77i  ]  =  LTC  =  T°.  (5.9) 

For  our  example,  from  (5.5),  [/7i  ]  =  Tl/  2+r  and  so  we  obtain  c  =  — 1/2.  A  common 
notation  for  the  similarity  variable  is  the  Greek  letter  77,  hence  we  relabel  77i  as  77 
yielding 

77  =  xt~1^2. 

The  similarity  function  is  a  scale-invariant  77  that  is  (most  simply)  linearly  related 
to  the  solution  of  the  model,  namely  a  —  1  in  (5.7).  We  must  make  choices  for  the 
exponents  b,  c  in  order  to  represent  the  solution  in  the  most  convenient  form;  such 
choices  are  typically  guided  by  the  side-conditions  present  in  the  problem. 

For  convenience  in  (5.3),  we  select  b  —  0  yielding 

n2  =  Utc  ==>  [772]  =  UTC  =  T°.  (5.10) 

This  requires  that  [/72]  =  T _1/2+c  an(j  hence  c  =  1/2;  we  relabel  772  as  /  (the 
similarity  function)  which  satisfies 


f  =  tl^2u. 

The  main  result  then  follows  from  the  analogous  result  in  the  second  part  of  the 
Buckingham  Pi  theorem  (see  Sect.  4.5.1) — that  all  dimensionless  parameters  must 
be  related  to  each  other.  Similarly,  this  implies  that  all  scale  invariant  parameters 
must  be  related,  which  we  state  in  the  form 


F(77!,772)  =  0  =*  F (77,  /)  =  0. 

Applying  the  implicit  function  theorem  then  gives 

/  =  fin)  =*>  tl/2u  =  f(xt~l/2). 
and  consequently  the  final  form  of  the  similarity  solution  of  the  PDE, 

u(x,  t)  =  t~1^2  f(xt~1/2).  (5.11) 


1  The  choice  b  =  1  leads  to  a  differential  equation  for  the  similarity  function  whose  dependence  on 

^-derivatives  will  mirror  the  x -derivatives  in  the  original  problem. 


5.2  Determining  the  Form  of  the  Similarity  Solution 
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We  note  that  using  this  representation  of  the  solution  reduces  the  boundary  condition 
at  x  =  0  for  t  >  0  to  the  condition  /(0)  =  0.  If  we  had  instead  selected  c  =  0  in 
(5.7),  that  choice  would  yield  772  =  ux  and  u(x ,  t)  =  x~{ g(xt~{^2).  The  boundary 
condition  at  v  =  0  would  then  be  in-determinant  (and  hence  more  difficult  to  work 
with). 


5.3  Solving  for  the  Similarity  Function 

Once  the  form  of  the  similarity  solution  of  the  PDE  has  been  determined,  substituting 
that  form  into  the  original  problem  will  yield  a  reduced  problem  for  the  similarity 
function  f(r]).  Substituting  (5.11)  into  (5.3a,  5.3b)  gives 

z  z  ar\  dr] 

roo 

r1/2/(0)  =  0,  /  f(r1)dr1  =  4. 

JO 

Noting  that  the  product  xt~2  can  be  rewritten  as  xt ~2  =  r\t ~3/2,  we  are  able  to  factor 
out  all  occurrences  of  the  variable  t  to  leave  a  reduced  problem  for  the  similarity 
function, 


df  df 

f  +  V-J-  -2/-f  =0,  / (0)  =  0, 

dij  dr] 

In  this  case,  we  have  reduced  a  PDE  to  an  ODE.  The  separation  between  the  new 
linearly  independent  variables  t  and  77  is  a  universal  feature  of  similarity  solutions 
stemming  from  the  scale-invariance  of  the  problem.  If  (5.12)  could  not  be  written 
in  a  form  independent  of  t,  then  an  error  must  have  been  introduced  in  one  of  the 
previous  steps;  this  is  a  useful  consistency  check  on  the  process. 

The  ODE  for  f(jj)  in  (5.12)  can  then  be  put  in  form  (rjf)'  =  (f2)'  and  directly 
integrated  to  give 

>7  f-f2  =  C. 

Evaluating  this  equation  at  ij  =  0  and  applying  the  condition  /(0)  =  0  then  fixes 
the  value  of  the  constant  of  integration  C  =  0  and  we  are  left  with  two  possible 
solutions, 


•00 


f  dr 7=4. 


(5.12) 


fin)  =  0,  or  ftm)  =  17. 


(5.13) 
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Inserting  the  nontrivial  solution,  f(rf)  =  ij,  into  (5.11)  we  obtain  a  self-similar 
solution  of  the  PDE  in  the  form 


u(x ,  t)  =  x/t,  (5.14) 

which  is  known  as  a  rarefaction  fan  (see  Sect.  2.6,  Eq.  (2.43)  for  a  shifted  version 
of  this  solution  obtained  via  the  method  of  characteristics).  Finally,  we  note  that 
satisfying  the  integral  condition  in  (5.12)  necessitates  the  use  of  both  solutions  from 
(5.13)  in  constructing  the  solution  of  (5.3)  (compare  with  Sect.  2.6). 


5.4  Further  Comments  on  Self-Similar  Solutions 

It  is  instructive  to  contrast  the  above  analysis  of  problem  (5.3a,  5.3b)  with  the  scaling 
and  non-dimensionalization  performed  on  problem  (4.21a,  4.21b)  in  Chap.  4.  The 
PDE  (4.21a)  is  very  closely  related  to  (5.3a),  and  we  will  show  that  this  equation 
does  indeed  have  similarity  solutions  of  the  form  (5.11)  in  Exercise  5.1. 

The  scaling  of  (4.21a,  b)  yielded  fixed  constants  for  all  of  the  scaling  parameters 
and  hence  that  problem  is  not  scale  invariant.  While  such  fully  specified  problems  set 
specific  scales  for  all  variables,  appropriately  modified  versions  of  such  problems 
may  have  similarity  solutions.  Such  modifications  may  include  omitting  initial  or 
boundary  conditions,  changing  the  domain  (say  from  0  <  v  <  L  to  0  <  v  <oo),  or 
even  omitting  some  terms  from  the  governing  equation.  In  Chaps.  6  and  7,  we  will 
see  how  such  changes  can  be  motivated  by  asymptotic  analysis.  Solutions  of  such 
problems  may  approach  the  similarity  solution  in  some  limit  (of  a  variable  or  system 
parameter)  and  are  sometimes  called  asymptotically  self- similar  solutions. 


5.5  Similarity  Solutions  of  the  Heat  Equation 


The  heat  equation  (or  diffusion  equation) 


du  d2u 
dt  dx 2 ’ 


(5.15) 


is  a  classic  example  in  the  investigation  of  symmetries  in  PDEs  because  it  admits 
several  classes  of  symmetries,  each  of  which  corresponds  to  a  different  similarity 
solution. 

Applying  the  scaling  (5.1)  yields 


d2u 

dZ2' 


5.5  Similarity  Solutions  of  the  Heat  Equation 
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Note  that  U  scales  out  due  to  the  linearity  of  the  equation,  and  must  be  determined  by 
side  conditions  that  further  specify  the  problem.  Making  the  PDE  scale-invariant  fixes 
the  relationship  between  the  length-  and  time-scales  for  diffusive  solutions,  L  =  VT 
(recall  the  discussion  of  the  diffusive  timescale  in  Sect.  4.3.3).  Consequently,  we  can 
write  TJ\  =  r\  —  xt~1^2  as  the  similarity  variable  for  all  typical  problems  for  (5.15). 

In  the  context  of  the  linear  diffusion  equation,  we  will  now  consider  how  the 
choice  of  side  conditions  can  affect  the  overall  structure  of  the  similarity  solution. 


5.5.1  Source-Type  Similarity  Solutions 


Consider  (5.15)  on  —  oo  <  v  <  oo  starting  from  a  non-negative  initial  condition  at 
t  =  0  that  decays  rapidly,  u( \x\  — >  oo)  — >  0.  Assuming  no  flux  from  infinity,  the 
solution  will  maintain  its  initial  mass  (assumed  finite), 


•oo 


•oo 


u(x,t)dx  =  /  u(x,0)dx  =  M. 


-oo 


-oo 


This  integral  constraint  will  be  scale  invariant  if  UL  =  1.  Consequently,  U  =  T  ^2, 
and  772  =  ut 1//2,  yielding  the  form  of  the  similarity  solution, 


u(x,t)  =  t  l^2f(xt  ^2). 


Substituting  this  form  into  (5.15)  yields  the  ODE  for  f(r]), 

~W  +  rif')  =  f\  (5.16) 

which  can  be  re-written  as  —  =  / rr  and  integrated  by  separation  of  variables. 

The  non-negative  solution  of  this  problem  is 


fW)  = 


M 


-r]2  /4 


V4 


Tt 


and  corresponds  to  the  solution  of  the  heat  equation, 


u(x,  t)  =  Le-*2/(4f)  (5.17) 

V47Z7 

which  is  known  as  the  Cauchy  similarity  solution  or  the  source-type  similarity  solu¬ 
tion ,  based  on  its  role  as  the  solution  of  the  heat  equation  starting  from  a  point  source 
(with  strength  M)  at  the  origin  at  t  =  0  (Fig.  5.1).  The  ‘Cauchy  solution’  label  for 
(5.17)  stems  from  the  fact  that  it  solves  the  Cauchy  problem  for  the  PDE,  namely,  an 
initial  value  problem  on  the  whole  line  —  oo  <  v  <  oo. 
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Fig.  5.1  {Left)  The  Cauchy  similarity  solution  (5.17),  (Right)  The  Boltzmann  similarity  solution 
(5.20) 


Note  that  (5.15)  is  also  invariant  under  translations  in  space,  v  =  x  +  a,  and  time, 
t  =  t-\-b;  namely  u  =  u(x,  t)  is  also  a  solution  of  the  heat  equation  for  any  choice  of 
the  shifts  a  and  b.  A  consequence  is  that  (5.17)  can  be  used  to  describe  the  long-time 
behaviour  for  more  general  initial  conditions,  with  parameters  that  depend  only  on 
the  mass,  centre  of  mass  and  variance  of  the  initial  data  (see  Chap.  1 1). 


5.5.2  The  Boltzmann  Similarity  Solution 

Consider  (5. 15)  on  the  semi-infinite  domain  0  <  v  <  oo  with  the  Dirichlet  boundary 
conditions 

u(x  =  0,  t)  =  1,  u(x  —>  oo)  —>  0. 

Having  the  boundary  condition  at  x  =  0  be  scale  invariant  requires  U  =  1.  We 
therefore  find  that  TJ2  =  u  and  the  similarity  solution  takes  the  form 

u(x,  t)  =  f(xt~l/2), 


with  f(rj)  satisfying  the  ODE 

~\rif'  =  f",  (5.18) 

which  can  be  integrated  once  to  yield  a  Gaussian,  and  then  formally  again  as  the 
integral  of  the  Gaussian, 

f(ri)  =  1- erf 07/2), 
where  the  error  function  is  defined  as 

[  e~t2  dt , 


erf(z)  = 


n  Jo 


(5.19) 


5.5  Similarity  Solutions  of  the  Heat  Equation 
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yielding  the  similarity  solution 

u(x,  t)  =  1  —  erf(v/V4 1 ).  (5.20) 


5.6  A  Nonlinear  Diffusion  Equation 


Finally,  we  consider  the  similarity  solution  for  the  Cauchy  problem  for  a  nonlinear 
diffusion  equation, 


du 

Yt 


(5.21a) 


with 


ux( 0,  t)  =  0, 


(5.21b) 


This  PDE,  often  called  the  porous  medium  equation ,  models  the  spreading  of  a 
puddle  of  very  viscous  fluid  (such  as  honey)  on  a  flat  solid  surface  moving  under  the 
influence  of  gravity  [  1  ] .  Here,  u  (x ,  t )  represents  the  height  of  the  fluid  layer  at  location 
v  and  time  t;  physically  meaningful  solutions  of  this  problem  will  necessarily  have 
u(x,  t)  >  0.  The  finite-mass  similarity  solution  of  this  problem  will  have  compact 
support.  In  other  words,  it  will  be  positive  on  a  finite  domain,  —x*(t)  <  x  <  x*(t), 
with  u  =  0  for  \x\  >  x*(t)  where  v*(f)  is  called  the  interface  or  contact  line. 

Proceeding  as  before,  making  the  PDE  scale  invariant,  yields  the  relation  U3  = 
L2/T  while  the  integral  sets  UL  =  1.  This  yields  the  scalings 

|_  =  T1/5  U  =  T-1/5, 


and  determines  the  form  of  the  similarity  solution  as 

u(x,  t)  =  t~1^5  f(xt~1/5). 

Substituting  this  form  back  into  the  original  system  reduces  it  to  an  ODE  problem 
for  f(rj) 

/v* 

f  dr]  =  M,  (5.22) 

-?7* 


2 


Without  loss  of  generality,  we  assume  the  support  to  be  symmetric  relative  to  the  origin. 
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Fig.  5.2  (Left)  The  u(x,t )  lowering/spreading  self-similar  solution  profiles  (5.23)  at  several  times, 
(Right)  The  profiles  scaled  in  terms  of  tl^u  =  /  and  t~[^5x  =  r\  showing  the  similarity  function 


where  r /*  is  a  constant  to  be  determined,  related  to  the  interface  position  by  x*(t)  = 
The  general  solution  of  the  differential  equation  can  be  obtained  in  closed 
form  and,  upon  application  of  the  boundary  condition  at  rj  =  0,  reduces  to 


/ 


3 


Finally,  for  fixed  initial  mass  M,  the  integral  condition  combined  with  continuity 
of  the  solution  (i.e.  /(±r /*)  =  0)  determines  C2  and  rj *,  thereby  specifying  / 
completely  as 


u(x,  t) 


(5.23) 


see  Fig.  5.2. 


5.7  Further  Directions 

In  the  examples,  we  have  seen  how  a  PDE  problem  can  sometimes  be  reduced  to  an 
ODE  problem  through  the  determination  of  a  self- similar  solution.  In  other  problems, 
scaling  invariance  may  be  used  to  reduce  PDEs  in  terms  of  several  variables  to 
PDE  depending  on  fewer  variables,  or  to  generate  autonomous  ODEs  from  non- 
autonomous  ones. 

The  books  by  Barenblatt  [9,  10]  are  an  excellent  introduction  to  problems  (taken 
from  various  different  fields)  that  admit  similarity  solutions.  Also  see  [2,  3,  88] 
for  example.  Other  books  provide  extensive  discussion  on  constructing  similarity 
solutions  [22,  52],  and  the  theory  of  symmetries  of  PDEs  [14,  32]. 


5.8  Exercises 
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5.8  Exercises 

5.1  Show  that  the  Burgers  equation, 

ut  +  uux  =  KUXX, 

with  k  >  0  admits  the  same  form  of  similarity  solution  (5.11)  as  problem  (5.3)  and 
determine  the  ODE  for 

5.2  Find  the  similarity  solution  of  the  inviscid  Burgers  equation  (5.3a)  with  the 
integral  condition  in  (5.3b)  replaced  by 


'OO 


'1 

u  (x,  t)  dx  =  1. 


Show  that  the  same  similarity  scalings  also  apply  to  the  generalised  Burgers  equation 


( j"2), + (j»3) 


=  KU 


xx 


X 


Show  that  if  k  =  0,  this  PDE  reduces  to  (5.3a). 

5.3  Consider  the  following  problem  for  u(x,  t)  on  0  <  v  <  oo: 


du 


du 


_<j  ,.4 


- Yu —  =  t  u  , 

dt  dx 


u(0,t)  =  t3,  u(x,  0)  =  0. 


(a)  Determine  the  value  of  the  constant  a  so  that  this  problem  has  a  similarity 
solution. 

(b)  Determine  the  scales  U ,  L  in  terms  of  the  timescale  T  for  the  self-similar  solution. 

(c)  Write  u(x,  t)  for  the  similarity  solution  as  a  product  of  some  power  of  t  and  a 
similarity  function  f(rj).  State  the  ODE  for  f(rj). 

5.4  Consider  the  heat  equation  for  u(x,  t)  on  the  half-line,  0  <  x  <  oo 


du  d2u 
dt  dx2 

For  each  of  the  following  sets  of  side-conditions,  determine:  (i)  the  invariant  scalings 
U,  L  in  terms  of  T,  (ii)  the  form  of  the  self-similar  solution  u(x,  t)  in  terms  of  f(r /), 
and  (iii)  the  ODE  and  boundary  conditions  that  f(jj)  should  satisfy. 

(a)  u( 0,  t)  =  t2  and  u{x  —>  oo,  t)  —  0. 

(b)  ux(0,  t)  =  2  and  u(x  oo,  t)  =  0. 

(c)  ux( 0,  t)  =  —u2( 0,  t)  and  u(x  — >►  oo,  t)  =  0. 
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r*  oo  o 

(d)  J0  x  u  dx  =  1  and  u(x  — >►  oo,  f)  =  0. 

(e)  ^*(0,  f)  =  — w(0,  0  and  u(x  — >►  oo,  f)  =  0. 

Hint:  Show  that  this  problem  does  not  have  a  solution  following  (i,  ii),  but  can 
still  be  obtained  from  solving  an  ODE  problem. 

5.5  Determine  the  scale  factors  and  similarity  function  ODE  for  the  similarity  so¬ 
lution  u(x,  t)  on  —oo  <  v  <  oo  for 


du  d2u 

—  =  — r  T  w 
dt  dx2 


u(\x\  ->  oo)  =  0. 


5.6  Consider  the  problem  for  u(x,  t)  on  —  oo  <  v  <  oo,  see  Fig.  5.3, 


du 

Yt 


udx  ~  1. 


(a)  Apply  the  scaling  (5.1)  in  order  to  determine  the  self- similar  solution  in  the 
form  u(x,t)  =  taf(rj)  with  ij  =  x/t@  whose  amplitude  decays,  and  width 
broadens  with  increasing  time  as  shown  in  Fig.  5.3  (left).  Determine  the  ODE 
for  the  similarity  function  f(r]).  The  corresponding  PDE  solution  is  called  an 
infinite-time  defocusing  solution  since  it  spreads  indefinitely. 

(b)  Under  other  conditions,  solutions  may  have  a  critical  time  tc  such  that,  when 
the  critical  time  is  approached  the  solution  will  diverge,  u{t  /  tc)  — >  oo. 
Using  (5.1)  with  t  =  tc  +  Tt  (with  t  <  0),  determine  the  ODE  for  the  similarity 

/V 

function  f(fj).  This  is  called  3, finite-time focusing  ox  finite-time  blow-up  solution; 
it  diverges  in  amplitude  and  narrows  in  width  as  the  critical  time  is  approached. 


X  X 


Fig.  5.3  Exercise  5.6:  (Left)  infinite-time  spreading  solution,  (Right)  finite-time  blow-up  solution 
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5.7  The  solution  u(r,  t)  of  the  following  problem  describes  the  height  of  a  circular 
drop  of  fluid  spreadingon  a  dry  surface: 


du 

~dt 


on  0  <  r  <  oo  with 


du 

dr 


=  0, 

r=0 


■oo 


u(r,  t)r  dr  =  4. 


The  solution  is  positive  over  a  finite  range  0  <  r  <  r*(t)  with  u(r*(t),  t)  =  0 
defining  a  moving  “edge”  with  no  fluid  beyond  this  position.  In  other  words,  take 
the  solution  beyond  the  edge  r  =  r*(t)  to  be  zero. 

(a)  Show  that  this  problem  is  scale  invariant. 

(b)  Determine  the  ODE  for  the  similarity  function 

(c)  Determine  the  similarity  solution  u(r,t )  =  taf(r])  and  r*(t).  (Hint:  Explain 
why  Jq°°  ur  dr  =  Jq*  ur  dr) 

5.8  Consider  the  system  of  two  coupled  PDEs  for  h(x,  t)  and  u(x,  t ), 


dh  d(hu) 

—  T - 

dt  dx 


du  du 

-  U - 

dt  dx 


d3h 
dx 3 


T  h~u. 


(a)  Let  h  =  H h(x,t),  u  =  U u(x,t),  x  =  Lx,  and  t  =  T t.  Determine  scaling 
relations  for  H,  U,  L  in  terms  of  T  to  show  that  these  PDEs  are  scale  invariant. 

(b)  Let  h  =  ta  u  =  g(j])  and  r)  =  xty .  Find  a,  /3,  y  for  the  similarity 

solution  for  this  problem  and  write  the  ODEs  satisfied  by  f(rj)  and  g(r]). 

5.9  Consider  the  problem 


du  d2u 
dt  dx2 


0  <  v  <  oo, 


with  boundary  conditions 


u(x  =  0,  t)  =  0,  dxu(x  oo,  t)  =  0. 


Find  the  similarity  solution  of  this  problem  and  then  use  this  to  determine  a,  b  in  the 
far-field  growth  of  the  solution,  u(x  oo,  t)  =  atb  and  c,  d  in  the  derivative  of  the 
solution  at  the  boundary,  ux( 0,  t)  =  ctd . 

Hint:  Express  your  solution  in  terms  of  the  error  function  (5.19),  which  satisfies  the 
asymptotic  approximations 


erf(v)  ~ 


as  v 


oo, 


erf(v)  ~ 


as  v 


0. 


Chapter  6 

Perturbation  Methods 


As  we  described  in  Chap.  4,  physical  problems  can  always  be  scaled  and  restated 
as  dimensionless  models.  The  scaling  process  identifies  the  relative  importance  of 
different  physical  effects  in  terms  of  the  magnitudes  of  the  dimensionless  parameters 
that  appear.  In  the  absence  of  actual  parameter  values,  problem- specific  analytical 
and/or  numerical  methods  are  typically  necessary  to  make  progress  towards  a  general 
solution.  However,  if  any  dimensionless  parameters  are  known  to  be  relatively  large 
or  small,  then  so-called  perturbation  methods  can  often  be  employed  in  order  to 
generate  accurate  approximations  to  the  solution. 

Perturbation  methods  provide  a  systematic  approach  to  constructing  approximate 
solutions  to  equations  such  as 

_  „  dx 

F(x,e)  =  0,  G(v,  y,  e)  =  0,  —  =  H(x,s),  ...,  (6.1) 

dt 

in  the  limit  of  a  vanishing  small  perturbation  parameter ,  s  ->  0.  This  is  accomplished 
through  the  introduction  of  asymptotic  expansions ,  whereby  the  original  problem  is 
decomposed  into  an  ordered  sequence  of  simpler  sub-problems.  The  solutions  of 
the  sub-problems  are  then  recombined  to  form  an  approximate  solution  to  the  full 
original  problem.  This  may  sound  very  similar  to  superposition  principles  that  are 
often  used  to  construct  solutions  of  linear  ODE  and  PDE,  but  a  notable  difference 
here  is  that  the  original  equation  (algebraic,  ODE  or  PDE)  may  be  nonlinear. 


6.1  Asymptotic  Analysis:  Concepts  and  Notation 

Asymptotic  analysis  provides  the  mathematical  framework  that  justifies  perturbation 
methods.  The  term  asymptotic  implies  a  limit  process  and  hence  every  asymptotic 
result  must  be  given  in  terms  of  a  stated  parameter  approaching  a  limiting  value;  for 
example:  e~x  — >►  1  as  x  — >  0  or  e~x  — >  0  as  x  — >  +oo. 
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Two  functions  f(s)  and  g(£)  are  said  to  be  asymptotically  equivalent  in  the  limit 
s  — >  0,  if 

f(s) 

lim  =  1.  (6.2) 

g(e) 

This  relationship  is  often  written  more  compactly  as  “/  ~  g  as  s  — >►  0”.  It  is 
important  to  note  that  this  definition  does  not  simply  just  mean  that  /  and  g  have 
the  same  limiting  values;  for  functions  that  approach  infinity  or  vanish  in  the  limit, 
equivalence  states  that  they  have  the  same  limiting  rate  of  growth  or  decay.  For 
example,  in  the  limit  s  0 

1  1  /7r  N  1  1 

— b  1  ~  -,  tan  (4-  —  e)  ~  -,  sin(e)  ~  s,  - ~  1  +  ex, 

£  £  £  1  —  £X 

but  sin(e)  oo  £2  despite  each  vanishing  in  the  limit.  We  therefore  see  that  asymptotic 
equivalence  does  not  imply  equality  of  the  functions,  but  it  is  a  necessary  condition  for 
them  to  have  the  same  limiting  behaviour.  As  a  consequence,  asymptotic  equivalence 
is  not  a  unique  relationship  (it  effectively  defines  equivalence  classes  of  functions). 
For  example,  cos(y/?)  ~  (1  —  s/2)  ~  e£  ~  (1  +  £)  as  s  ->  0. 

A  related  description  of  asymptotic  behaviour  is  given  by  the  order  relation , 
“/  =  0(g)  as  £  ->  0”  (read  as  “/  is  big-Oh  of  g”)  defined  by 

f(s) 

lim  — -  =  A ,  (6.3) 

g(e) 

where  A  is  finite.  The  statement  /  =  O(g) ,  identifies  the  function  f(s)  as  having 
comparable  growth  or  decay  as  g(s)  as  s  — >►  0.  For  example,  sin(e)  =  0(4s)  as 
e  ->  0,  and  AH  =  0(NN+^2e~N)  as  N  — >►  oo.  The  latter  result  is  known  as 
Stirling’s  formula  [13],  a  central  result  for  estimating  large  combinatorial  values  in 
computing  and  probability  applications.  The  notation  (9(1)  is  often  used  to  describe 
expressions  having  finite  limiting  values  and  separates  quantities  that  are  singular 
(/  — >  oo  as  in  /  =  (9(£-1))  from  those  that  vanish  (/  — >  0  as  in  /  =  0(£2))  as 
£  — >  0. 

There  is  also  a  “little  oh”  relation,  “/  =  o(g)”  ase  — >►  0  (also  written  as  “/  <<C  g”) 
describing  /  being  asymptotically  smaller  than  g  if 

f(£) 

lim  =  0.  (6.4) 

g(e) 

The  6>(-)  notation  is  used  to  indicate  weak  effects  due  to  smaller  (“higher  order') 
terms  that  can  be  neglected  in  comparison  with  other  larger  terms.  Since,  for  example, 
s2  =  o(c)  we  may  then  write  2c  +  3c2  ~  2s.  The  o(0  notation  is  also  often  used 
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to  state  that  the  difference  between  two  results  is  smaller  than  a  specified  reference 
quantity,  as  in 

e£  —  ^1  +  s  +  =  o(s2)  =>-  eE  =  1  +  £  +  j£2  +  o(£2).  (6.5) 

We  note  that  our  descriptions  of  these  asymptotic  relations  in  terms  of  limits  of 
ratios  are  actually  convenient  simplifications  of  more  rigorous  definitions.  In  partic- 
ular,  /  =  0(g)  if  |/|  <  A|g|  for  all  sufficiently  small  £  [13,  72].  The  simplifications 
are  sufficient  for  most  cases,  but  can  fail  if  the  limit  is  not  defined.  For  example,  it  is 
true  that  sin(x)  =  0(1)  as  x  — >  oo  since  |  sin(x)|  is  bounded  by  one,  even  though 
is  does  not  have  a  limit.  Asymptotic  equivalence  is  more  generally  defined  along  the 
lines  of  (6.5),  in  terms  of  the  difference  between  the  functions  being  small,  /  ~  g  if 
/  -  g  =  o(g). 


6.2  Asymptotic  Expansions 

Having  established  the  order  notation,  we  can  now  describe  the  structure  of  basic 
asymptotic  expansions.  Consider  a  function  x (t,  s)  that  can  be  expanded  in  terms  of 
a  “separation  of  variables”-type  series 

x(t,  s)  =  So(s)xo(t)  +  <5i(£)*i(0  +  <$2(£)*2(0  5 - ,  (6.6) 

where  it  is  assumed  that  all  xn  =  0(1)  and  the  gauge  functions  {<$n(£)}  are  asymp¬ 
totically  ordered  in  size  as  £  — >  0  so  that 

5o(£)  <$i(£)  ^2(e)  »  •  •  •  .  (6.7) 

Equations  (6.6)  and  (6.7)  define  the  basic  form  of  an  asymptotic  expansion  (AE)  and, 
in  particular,  (6.6)  separates  the  dependence  on  the  asymptotic  parameter  £  in  each 
term  from  the  coefficients  xn  (which  may,  in  general,  also  be  functions  of  other 
independent  variables  and  parameters  in  the  model).  The  first  term  in  the  expansion 
(6.6)  is  usually  referred  to  as  the  leading  order  term ,  v  ~ 

There  are  subtle  differences  in  the  way  one  might  write  the  basic  content  of  the  first 
few  terms  of  an  asymptotic  expansion  based  on  our  previous  discussion  of  asymptotic 
ordering.  Consider 

v  —  8oxo  +  8\x\  +  82x2, 

X  =  80X0  +  <5 1*1  +  0(82 ),  (6.8) 

*  =  5o*o  +  <5 1*1  +  0(<$i)- 

The  first  expression  gives  the  first  three  terms  explicitly,  the  second  gives  two  terms 
and  an  estimate  on  the  asymptotic  order  of  the  remainder  of  the  expansion,  and  the 
third  expression  does  not  predict  82  but  just  states  that  the  omitted  contributions  are 
all  smaller  than  the  last  given  (9(<5i)  term  which  thus  effectively  provides  only  the 
information  contained  in  the  two  term  estimate  *  ~  +  <$1*1. 
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A  simple  example  of  an  asymptotic  expansion  is  provided  by  the  Taylor  series  of 
a  smooth  function  x(t)  expanded  in  a  neighbourhood  of  a  point  4 

x(t)  =  v(4)  +  gx  (4)  +  ^j£2x  (4)  +  (4)  +  *  •  •  ,  (6.9) 

where  s  =  t  —  4  (cf.  (6.6));  in  this  case,  the  small  parameter  is  e,  the  separation 
between  the  fixed  point  4  and  the  variable  t.  We  know  from  calculus  that  in  the  limit 
£  — >  0,  an  accurate  estimate  of  the  value  of  x(t)  in  a  neighbourhood  of  4  can  be 
obtained  from  a  limited  number  of  terms  in  the  expansion. 

For  some  functions,  the  asymptotic  expansion  can  be  handled  by  symbolic  algebra 
programs  (such  as  Maple  or  Mathemat  ica).  In  Maple,  for  example,  to  calculate 
the  first  six  terms  in  the  expansion  of  x{t)  =  eimt  for  t  — >  0  (here  t  is  the  small 
parameter),  the  command  is  simply 

>  series ( exp ( tan ( t )), t=0 ) ; 

1  9  lo  3  4  37  c  f. 

1  + 1  +  -r  +  -r  +  -r  h - r  +  O(r). 

2  2  8  120 

Maple  can  also  generate  asymptotic  expansions  in  other  limits,  such  as  for  x(t)  = 
1/(1  +  At)  as  t  — >  oo 


>  asympt (1/ (l+4*t) , t) ; 


1 

-  t 
4 


-l 


1 

16 


r2  + 


l 


256 


t  4  + 


1 


1024 


r 5  +  <9 


where  the  gauge  functions  are  now  inverse  powers  of  t\  this  result  can  also  be  derived 
as  a  Taylor  series  expansion  with  respect  to  the  variable  t  =  1  /s  in  the  limit  e  — >  0. 

We  note  that  asymptotic  expansions  can  take  more  complex  forms  than  (6.6).  For 
example,  the  function  x(t,  s)  =  exp(sin(s^)/s2),  has  the  asymptotic  expansion 


x  ~  e^£  ( 1  —  -st3/2  +  —s2t3  +  •  •  • 

V  6  72 

which  is  clearly  not  of  the  form  (6.6).  Our  investigations  of  solutions  to  equations 
like  (6.1)  will  be  further  complicated  by  the  fact  that  the  form  of  the  solution  is  not 
known  beforehand  and  so  we  will  need  to  find  the  gauge  functions  as  well  as  the 
coefficients. 


^  as  s  — >  0,  (6.10) 


6.2.1  Divergence  of  Asymptotic  Expansions 

Example  (6.10)  indicates  that  asymptotic  expansions  exist  even  in  cases  where  con¬ 
vergent  Taylor  series  do  not.  In  fact,  summed  over  all  terms,  asymptotic  expansions 
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can  diverge,  but,  as  we  will  shall  show,  this  does  not  prevent  partial  sums  like  (6.8) 
from  providing  accurate  results  in  the  vanishing  limit  of  the  perturbation  parameter. 

In  order  to  illustrate  the  above  points,  we  consider  an  example  of  a  function  defined 
in  terms  of  an  integral,  namely 


-t 


1  H-  £t 


dt; 


(6.11) 


such  a  definition  is  rather  common  and  includes  Fourier  and  Laplace  integrals,  as 
well  as  Mellin,  Hankel  and  many  other  integral  transformations,  and  a  number 
of  special  functions,  such  as  the  Gamma,  Beta,  Airy  and  error  functions  [11,  13, 
29,  56]. 

The  Taylor  series  (6.9)  of  the  integrand  in  the  limit  £  ^  0  leads  to  the  asymptotic 
expansion 

oo 

/(e)  ~  1  -  e  +  2e2  -  6e3  +  •  •  •  =  ^(-l)"n!e".  (6.12) 

n= 0 


The  standard  ratio  test  for  power  series  requires  (n  +  l)e  <  1 ,  so  we  find  that  the  series 
has  a  zero  radius  of  convergence  (i.e.  it  diverges  for  all  \s\  >  0  at  sufficiently  large 
values  of  n).  Furthermore,  convergence  for  s  in  an  interval  around  zero,  —  8  <  s  <  8, 
should  not  have  been  expected  since  for  s  <  0  the  singularity  at  t  =  —  1  /e  is  not 
integrable.  Despite  these  issues,  Fig.  6. 1  shows  that  (6.12)  truncated  at  a  finite  number 
of  terms  provides  a  good  estimate  of  the  value  of  the  integral  as  s  — >  0+ . 

To  understand  this  behaviour,  consider  (6.11)  with  s  =  the  integral  can  be 
numerically  evaluated  to  give  /(0.1)  ~  0.915633.  For  (6.12),  the  terms  in  the  ex¬ 
pansion  remain  asymptotically  ordered  while  (n  +  l)e  <  1,  namely  for  n  <  10. 
Recalling  that  the  first  term  neglected  in  the  asymptotic  expansion  gives  an  estimate 
of  the  error  (cf.  (6.8)),  we  see  that  truncating  the  expansion  up  to  and  including  ten 
terms  will  give  a  decreasing  magnitude  for  the  error,  but  going  beyond  ten  will  lead 


Fig.  6.1  (Left)  The  magnitude  of  term  an  in  (6.12)  for  s  =  0.1.  For  n  <  10,  the  terms  decay  in 
size,  but  thereafter  start  to  grow.  (Right)  I (s)  from  (6.11)  compared  with  the  partial  sums,  /#(e), 
of  series  (6.12)  showing  the  error  growing  for  s  >  0.1  for  above  the  optimal  truncation 
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to  the  error  increasing  in  size  (see  Fig.  6.1).  This  is  in  stark  contrast  to  convergent 
series,  where  retaining  more  terms  always  reduces  the  error. 

To  summarise,  asymptotic  expansions  are  always  accurate  in  the  limit  e  — >  0,  but 
for  finite  e,  there  will  be  an  optimal  truncation  n  =  0,  1,  ... ,  N(s),  that  minimises 
the  error.  In  most  cases,  however,  even  just  the  first  few  terms  from  the  asymptotic 
expansion  can  yield  an  excellent  approximation  to  the  true  solution,  as  illustrated 
by  /4(e)  in  Fig.  6.1.  Complications  stemming  from  attempting  to  interchange  limits 
(e  — >  0  and  n  — >►  00)  will  appear  in  several  situations  and  often  signal  subtle  changes 
of  behaviour  in  the  asymptotic  expansion. 


6.3  The  Calculation  of  Asymptotic  Expansions 

Rather  than  calculating  asymptotic  expansions  of  given  functions,  we  are  usually 
more  interested  in  constructing  an  asymptotic  expansion  of  solutions  to  problems  of 
forms  like  (6.1). 

We  begin  by  calculating  asymptotic  expansions  of  solutions  to  algebraic  equa¬ 
tions,  for  which  the  coefficients  {xn}n=0,l,2,...  in  (6.6)  are  necessarily  constants.  It 
will  be  useful  to  classify  solutions  (and  the  corresponding  asymptotic  expansions) 
in  the  limit  s  — >  0  into  the  following  types: 

•  Regular  solutions ,  which  have  finite  limits:  lim^o*  =  *0  with  the  leading  order 
gauge  function  in  (6.6)  being  <$o  =  1. 

•  Vanishing  solutions ,  which  are  regular  solutions  with  a  zero  limit,  i.e.  lim^o*  =  0 

with  the  leading  order  gauge  function  satisfying  60(e)  1. 

•  Singular  solutions,  which  have  divergent  limits:  lim^oW  =  00,  corresponding 

to  a  finite  vo  with  a  singular  60(e)  1. 

We  now  introduce  two  approaches  for  constructing  solutions  in  the  form  of  as¬ 
ymptotic  expansions  known  as  the  expansion  method  and  the  iteration  method.  We 
will  illustrate  both  methods  applied  to  a  simple  example  for  which  the  solutions  can 
be  found  explicitly. 

Consider  the  quadratic  equation 

x2  —  x  +  =  0  as  e  — >  0,  (6.13) 

its  exact  solutions  being 

1  ± 


%A,B  = 


2 


(6.14) 
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Employing  the  generalised  binomial  expansion  for  f(z)  =  (1  +  z)r,  which  can  be 
derived  by  application  of  Taylor  series, 


(1  +  z)r  ~  1  +  rz  + 


r(r  — 1)  2  i  r(r  — l)(r— 2)  3 


■Z  + 


3! 


Z"  + 


as  z 


0, 


(6.15) 


with  r  =  i,  z  =  —  £,  allows  us  to  write  the  two  solutions  as 


xa  —  l  —  jfiS2  +  •  •  •  —  0(1), 

Xb  =  0  +  +  -j^£2  +  •  •  •  = 


(6.16) 


We  note  that  taking  g  «  1  directly  in  (6.13)  gives  x2  —  x  ~  0  yielding  approximate 
solutions  v  ~  0,  1.  In  other  words,  the  balance  of  the  first  two  terms  in  (6.13) 
is  important  in  locating  the  roots,  with  the  third  term  only  slightly  adjusting  their 
values  (cf.  (6.16)).  This  is  the  hallmark  of  a  regular  perturbation  problem ,  for  which 
the  limit  s  — >  0  only  yields  regular  (i.e.  non-singular)  solutions  with  well-defined 
limits  for  s  — >  0. 

We  will  begin  our  analysis  with  the  more  concise  ‘expansion  method’,  which 
benefits  from  an  a  priori  assumption  on  the  form  for  the  asymptotic  expansion  being 
sought. 


6.3.1  The  Expansion  Method 

In  order  to  solve  (6.13)  without  explicit  reference  to  the  exact  solutions,  we  assume 
the  solutions  are  regular  and  have  8n  =  sn  for  n  =  0,  1,2,...  (i.e.  the  sequence  of 
gauge  functions  {£„(£)}  is  already  known),  so  that 

a 

X  —  Vo  +  £X\  +  £  X2  "T  •  •  •  .  (6.17) 

This  is  a  commonly  occurring  asymptotic  expansion  for  regular  solutions,  which  can 
be  thought  of  as  being  a  Taylor  series  expansion  of  the  solution  with  respect  to  the 
parameter  s  around  £  =  0. 

Substituting  (6.17)  into  Eq.  (6.13)  yields 

(Vo  +  £X\  +  £2X2  +  •  •  •  )^  —  (Vo  +  £X\  +  £2 X2  +  •  •  •  )  +  ~£  =  0, 

and  ordering  terms  in  powers  of  £  yields 

(xq  —  +  £  ^2x\xq  —  x\  +  — ^  +  s2 (x2  +  2xqX2  ~  X2)  +  •  •  •  =  0  +  £0  +  s2 0  +  •  •  •  . 
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Assuming  that  the  coefficients  are  0(1),  requiring  both  sides  of  the  equation  to 
balance  leads  to  the  system  of  equations 


0(e°)  :  Xq  —  xo  =  0  =4> 

0(sl)  :  2xixo  -  xi  +  \  =  0  => 

0(s2)  :  x\  +  2xqX2  —  X2  =  0  => 


X()  =  1  or  X()  =  0, 

Xi  =  —1/4  |  xi  =  1/4,  (6.18) 

*2  =  -1/16  |  X2  =  1/16, 


and  so  on  for  higher  orders.  Note  that  only  the  leading  order  equation  is  nonlinear; 
subsequent  corrections  depend  on  which  of  the  leading  order  solutions  is  considered. 
Comparing  the  coefficients  resulting  from  (6. 1 8)  with  (6. 1 6),  we  observe  that  we  have 
obtained  the  asymptotic  expansions  for  both  solutions  of  (6.13). 


6.3.2  The  Iteration  Method 

In  contrast  to  the  expansion  method,  we  no  longer  assume  a  form  for  the  entire 
asymptotic  expansion,  but  instead  only  look  (initially)  at  the  leading  order  term 
x  ~  xo<$o(£),  with  both  xo  and  8o(s)  to  be  determined.  There  are  two  fundamental 
assumptions  made  in  relation  to  the  leading  order  term: 

(i)  For  every  nontrivial  solution  (i.e.  not  all  xn  =  0),  the  leading  order  term  must 
be  nontrivial:  xo  /  0  and  7^  0.1 

(ii)  The  leading  order  coefficient  is  finite,  xo  =  6>(l)as£^0  (i.e.  we  are  looking 
for  regular  solutions;  we  tackle  singular  solutions  later). 

Substituting  the  leading  order  term  into  (6.13)  yields 

Xq5q  —  xo<5()  +  =0.  (6.19) 

(l)  (2) 

In  order  for  this  equation  to  hold  as  s  — >  0,  at  least  two  of  the  terms  must  balance  in 
asymptotic  scales,  with  the  remaining  terms  being  sub-dominant  (i.e.  asymptotically 
smaller  than  the  retained  terms  as  s  0).  The  smaller  terms  can  then  be  neglected 
in  determining  the  leading  order  solution.  The  set  of  dominant  terms  that  balance  to 
yield  the  leading  order  solutions  are  called  the  dominant  terms  and  this  argument  is 
referred  to  as  the  principle  of  dominant  balance. 

Ignoring  all  0(1)  coefficients  for  the  time-being,  consider  the  three  possibilities 
for  balancing  the  asymptotic  gauges  of  the  potential  dominant  terms  in  (6.19): 

(a)  Terms  (1,  2):  <$q  =  <$o  =4>  <$o  =  1 

(b)  Terms  (1, 3):  5q  =  £  =>  <$o  =  (6.20) 

(c)  Terms  (2,  3):  <$o  =  £  =>  =  £ 


trivial  solutions  (x  =  0)  are  exact  solutions  only  if  they  satisfy  the  full  problem  for  all  s. 
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In  case  (a),  60  =  0  is  excluded  by  the  first  fundamental  assumption  (that  6o  should 
be  non-zero).  It  is  also  clear  that  for  this  problem,  all  three  terms  cannot  be  of  the 
same  asymptotic  order. 

For  each  of  the  proposed  balances,  we  must  now  attempt  to  verify  the  principle 
of  dominant  balance.  Namely,  it  is  essential  to  confirm  that  all  sub-dominant  terms 
which  have  been  omitted  from  the  dominant  balance  are  indeed  smaller  than  the 
dominant  terms.  When  this  occurs,  the  balance  is  called  a  distinguished  limit. 

For  the  three  possible  balances  in  (6.20)  we  see  that 

(a)  Terms  (1,  2)  :  6q  =  6o  =  0(1)  Term  (3)  :  epsilon  =  0(s) 

(b)  Terms  (1,  3)  :  6q  =  s  =  0(e)  Term  (2)  :  6o  =  0(yfe) 

(c)  Terms  (2,  3)  :  6o  =  £  =  0{s)  Term  (1)  :  6q  =  0(s 2) 

In  case  (b),  we  see  that  the  second  term  is  not  sub-dominant  to  the  first  and  third 
terms,  and  so  is  not  a  valid  balance.  This  leaves  cases  (a)  and  (c)  as  valid  dominant 
balances  leading  to  distinguished  limits. 

The  coefficients  in  each  case  can  now  be  obtained: 

0  1 

(a)  <5o  =  1:  Eq.  (6.19)  becomes  Xq  —  xo  +  =  0.  Thus,  as  s  ->  0,  the  leading  order 

dominant  balance  determines  the  leading  order  coefficient  from 

Vq  —  X()  =  0  =>-  *0  =  1, 

where  we  reject  the  root  xo  =  0  in  line  with  assumption  (i)  above.  We  see  that 
*0  =  1  corresponds  to  the  first  term  in  xa  from  (6.16). 

(c)  5o  =  £•  Eq.  (6.19)  takes  the  form  sx q  —  xo  +  \  =0  (where  we  have  divided 
through  by  the  common  factor  s).  The  rescaled  leading  order  equation  yields 
xo  =  1/4,  which  together  with  8o  =  £  determines  the  first  term  in  xB  ~ 
from  (6.16). 

Further  terms  in  the  expansion  of  each  solution  can  be  obtained  by  repeating  the 
process:  determining  the  distinguished  limits  for  the  gauge  functions  8i(s)  and  the 
values  for  the  coefficients  x*  using 


xa  ~  1  +  8\ax\a  and  xB  ~  \e  +  615x15, 

Xa  ~  1  +  &1AX1A  +  &2AX2A  and  XB  ~  \s  +  615X15  +  &2AX2A, 

and  so  on,  substituting  into  (6.13)  at  each  step  and  requiring  that  the  gauge  functions 
be  asymptotically  ordered,  i.e.  1  8\a  ^  62 a  and  s  615  625.  In  this  manner, 

we  can  obtain  the  expansions  of  xa  and  xB  from  (6.16). 

Obviously,  as  more  terms  are  retained  in  the  expansion,  the  determination  of  the 
dominant  balance  requires  an  increasing  amount  of  work  and  this  can  be  a  substantial 
drawback  of  the  iterative  approach.  Consequently,  the  expansion  method  is  often 
favoured  in  obtaining  a  rapid  result.  However,  while  both  roots  for  this  example 
could  be  obtained  using  the  expansion  (6.17),  we  will  encounter  many  situations  for 
which  the  principle  of  dominant  balance  will  be  particularly  useful. 
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6.3.3  Further  Examples 

Suppose  we  wish  to  find  the  roots  of  the  transcendental  equation 

x2  —  2x  +  s  sinv  =  0  as  s  — >  0.  (6.21) 

This  equation  does  not  admit  explicit  expressions  for  all  roots,  but  it  can  be  seen  that 
x  =  0  is  a  solution  for  any  e.  Setting  s  =  0  in  (6.21)  gives  the  leading  order  equation 

Xq  —  2xo  =  0  =>-  jc()  =  0  or  Vo  =  2.  (6.22) 

We  will  focus  on  the  expansion  for  the  nontrivial  root,  see  Fig.  6.2. 

While  we  may  again  proceed  by  hand,  we  take  this  opportunity  to  demonstrate  how 
symbolic  algebra  software  can  be  employed  to  perform  some  of  the  computationally 
intensive  calculations.  Using  Maple,  we  write 

>  eps : =epsilon; 

eps  :=  s 

>  eq: =x"2-2 *x+eps*sin (x) ; 

eq  :=  x  —  2x  +  s  sin  (x) 

>  x: =x0+eps*xl+eps"2*x2+eps~3*x3 ; 

x  :=  xO  +  s  xl  +  s2x2  +  s3x3 

>  eqser : =series ( eq, eps=0 , 3 ) ; 

eqser  :=  xO2  —  2x0  +  (2x0x1  —  2x1  +  sin  (xO))  e 

+  ( 2x0x2  +x72  —  2x2  +  cos  (. x0)xl )  s2  +  O  (e3) 

>  orderO : =coeff ( eqser , eps , 0 ) ; 

orderO  \=  xO2  —2x0 

>  orderl : =coeff ( eqser , eps , 1 ) ; 

order  1  :=  2x0x1  —  2x1  +  sin  (xO) 

>  order2 : =coef f ( eqser , eps , 2 ) ; 

order2  \=  2x0x2  +  xl 2  —  2x2  +  cos  ( x0)xl 


Fig.  6.2  Convergence  of  the 
function 

f(x,  s)  =  x2  —  2x  +  e  sinx 
for  s  =  2 ,  \ |  to  the  e  =  0 
limit 


0 


x 
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The  series  ( )  command  performs  a  Taylor  series  expansion  on  the  equation.  The 
order 0  equation  is  simply  our  leading  order  problem  (6.22).  Separating  out  the 
coefficient  of  0(sl)  and  substituting  the  value  vo  =  2  into  orderl  yields  the 
value  for  x\  =  —  \  sin  2.  Continuing  on,  the  order  2  equation  gives  X2  =  \  sin  2  — 

^  sin2  2.  The  expansion  up  to  0(s 3)  for  the  nontrivial  root  is  therefore 


(sin2)£  + 


-  sin2  2^  s 2 

8  / 


+  0(£3). 


To  recap,  when  the  expansion  method  works,  it  is  a  very  straightforward  approach 
to  carry  out,  either  by  hand  or  with  the  aid  of  computer  software.  The  limitation  of 
the  method  is  that  it  will  fail  to  construct  solutions  that  are  not  of  the  form  assumed 
for  the  asymptotic  expansion.  The  trial  solution  need  not  be  (6.17)  (experience  can 
provide  good  guesses),  but  the  choice  of  the  gauge  functions  8n  must  be  known  before 
this  method  can  be  used. 

Our  next  example  introduces  a  problem  for  which  the  expansion  method  using 
the  asymptotic  expansion  (6.17)  fails,  and  consequently  illustrates  the  strengths  of 
the  iteration  method.  Consider  the  s  — >  0  limit  of  the  equation 

(x  -  l)2  -  9s  =  0.  (6.23) 

Substituting  the  expansion  (6.17)  into  Eq.  (6.23)  leads  to  the  system  of  equations 

0(e°)  :  (vo  —  1  )2  =  0  =>  vo  =  1  or  vo  =  1, 

0(sl)  :  2xi(x0-  1)  -  9  =  0  =>■  -9  =  0  (x?) 

0(s2)  :  x\  +  2(xo  -  l)x2  =  0 


Substituting  the  leading  order  double  root  into  the  O  (s)  equation  has  yielded  —9  =  0, 
a  clear  contradiction.  This  indicates  that  our  choice  for  the  expansion  of  v  was 
incorrect,  namely  that  the  solutions  of  (6.23)  are  not  of  the  assumed  form  (6.17).  In 
fact,  as  is  trivial  to  verify,  the  exact  solutions  are  v  =  1  db  3^/s. 

Applying  the  iteration  method  to  (6.23)  with  v  ~  vo<$o(£)  +  vi<$i(e),  we  would 
find  directly  that  the  first  two  gauge  functions  are  given  by  <$o  —  1  and  8\  =  and 
the  coefficients  are  vo  =  1,1  and  x\  =  ±3;  looking  for  higher-order  terms  would 
yield  coefficients  v*  =  0  for  i  >2. 

Problems  having  degenerate  (repeated)  leading  order  roots  should  always  be 
treated  with  caution  regarding  their  expansions.  There  are  also  many  problems  re¬ 
quiring  non-algebraic  gauge  functions  (<$(£)  ^  sa)\  such  problems  frequently  arise 
from  transcendental  equations  (see  [47,  49]). 
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6.4  A  Regular  Expansion  for  a  Solution  of  an  ODE  Problem 


The  expansion  and  iteration  methods  also  extend  to  solving  ODE  and  PDE  problems, 
the  only  major  difference  being  that  at  each  order  we  will  have  to  determine  a  function 
rather  than  just  a  constant  coefficient. 

We  now  illustrate  the  application  of  the  expansion  method  to  find  the  solution  of 
an  ordinary  differential  equation  using  the  projectile  problem  (4.7a)  introduced  in 
Chap.  4, 


d2x  1 

dt2  (1  +  sx)2  ’ 


*(0)  =  1, 


(6.24) 


in  the  limit  s  — >  0.  We  begin  by  substituting  the  asymptotic  expansion  x(t)  ~ 
xo(t)  +  sx\(t)  +  £2X2 (t)  +  •  •  •  into  the  ODE  and  initial  conditions.  The  expansion 
for  the  left-hand  side  of  the  ODE  is  simply 


~^~2  —  “I”  £x\(t)  +  £2X2  (t)  +  •  •  • 


and  using  the  binomial  expansion  (6.15)  with  r  =  —2,  z  =  sx,  the  right-hand  side 
becomes 

- 7T  =  —  1  +  2£x  —  3£2x2  +  4£3x3  +  •  •  •  (6.25) 

(1  +  £X)2 


In  fact,  this  needs  to  be  further  expanded  using  the  asymptotic  expansion  for  x(t)  to 
yield 

=  —  1  +  2£(xq  +  £X\  +  •  •  •  )  —  3£2(xq  +  £X\  +  •  •  *  )2  +  •  *  • 

—  — 1  2£xq  4“  £2 (2x\  —  3vq)  H-  £3 (2x2  —  6xqx\  4“  •  •  • )  H-  D(s^). 


The  initial  conditions  provide  initial  conditions  on  the  coefficient  functions  xn(t)  of 
the  asymptotic  expansion  through  a  comparison  at  each  order  in  £ 

x(0)  =  1  =>-  *o(0)  +  £Vi(0)  +  £2V2(0)  +  •  •  •  =  1  +  £0  +  £20  +  •  •  •  , 

X^O)  =  a  =>►  Vq(0)  +  £vj(0)  +  £2V2(0)  +  •  •  •  =  Of  +  £0  +  £20  +  •  •  •  . 

We  can  now  separate  the  original  ODE  problem  (6.24)  into  sub-problems  for  each 
xn(t)  at  0(£n) 


O(s0)  : 

x”  —  -  1 

x0(0) 

=  1 

Xq(0)  =  a. 

Ois1)  : 

x'[  —  2  vo 

xx  (0) 

=  0 

V1(0)  =  0, 

0(e2)  : 

*2  =  2vi  —  3vq 

x2  (0) 

=  0 

*£(0)  =  0, 

and  so  on.  The  solution  to  the  higher  order  problems  depends  on  the  solutions  from 
the  lower  order  ones  and  so  we  must  solve  the  sub-problems  in  sequence.  Solving  the 
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0(6°)  problem  yields  the  leading  order  solution,  vo(0  =  —  \t2  +at  + 1.  Substituting 
xo  into  the  O  (s 1 )  equation  we  obtain, 

x ^  —  — t2  4-  2a  t  H-  2  y  x\(t )  —  — "F  2*^  H- t2 

and  results  at  higher  orders  follow  analogously.  Reassembling  the  asymptotic  expan¬ 
sion  for  the  solution  gives 

x(t)  =  [~\t2  +  at  +  l)  +  £  [~j2^  +  f  f3  +  t2)  +  0(s2).  (6.26) 

Physically,  the  flight  of  the  projectile  ends  when  the  ground  is  reached  (0  <  t  <  t* 
for  v  (4)  =  0),  but  mathematically,  there  is  nothing  stopping  us  from  considering  the 
behaviour  predicted  by  (6.26)  for  longer  times.  Note  that  the  asymptotic  ordering  of 
the  expansion  breaks  down  with  0(x o)  =  0(sx i)  =  0(\/s)  when  t  =  0(1/ *Js) 
and  the  construction  of  the  solution  would  also  break  down  at  this  point,  since  the 
assumption  that  \sx\  |  <<C  l-^ol  implicit  in  (6.25)  would  be  violated.  This  is  a  common 
occurrence  for  asymptotic  expansions  and  we  will  also  see  examples  of  systems  in 
Chaps.  7  and  8  where  such  difficulties  arise.  Such  a  breakdown  indicates  a  transition 
in  scaling  regimes  and  the  resolution  of  the  problem  involves  identification  of  the 
appropriate  new  scaling.  In  the  next  section,  we  will  consider  this  issue  further  in  the 
context  of  determining  solutions  to  singular  perturbation  problems. 


6.5  Singular  Perturbation  Problems 

Problems  having  one  or  more  solutions  that  exhibit  singular  (divergent)  behaviour 
in  the  limit  s  — >  0  are  called  singular  perturbation  problems.  The  singular  solutions 
are  not  obtainable  from  asymptotic  expansions  appropriate  to  regular  solutions,  such 
as  (6.17);  attempts  to  use  such  asymptotic  expansions  will  return  only  a  subset  of  the 
solutions  of  the  full  problem,  or  at  worst,  no  solutions  at  all  (see  Exercise  6.3). 

This  behaviour  is  inherent  to  all  classes  of  singular  perturbation  problems: 

•  For  algebraic  equations,  a  singularly  perturbed  Nth  degree  polynomial  will  only 
have  M  regular  solutions,  with  M  <  N  if  the  leading  order  equation  reduces  to  an 
Mth  degree  polynomial  for  vo  due  to  terms  of  the  form  spxK  (p  >  0)  vanishing 
for  M  <  K  <  N. 

•  For  differential  equations,  if  the  limit  e  — >  0  causes  the  highest  order  derivative  to 
vanish  (e.g.  spdN x/dtN),  the  leading  order  solution  of  the  remaining  lower  order 
problem  will  typically  not  have  enough  degrees  of  freedom  to  satisfy  all  of  the 
initial  or  boundary  conditions  imposed  on  the  original  problem. 
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•  Likewise,  in  other  classes  of  singular  problems,  the  s  =  0  leading  order  problem 
may  be  dramatically  reduced  from  the  full  problem  for  s  >  0,  for  example: 
a  singular  PDE  reducing  to  an  ODE,  a  singular  ODE  reducing  to  an  algebraic 
equation,  or  a  system  of  equations  reducing  to  a  single  equation. 

The  non-regular  solutions  are  not  truly  lost  by  such  reductions,  and  can  be  recovered 
through  appropriate  rescalings  of  the  original  problem.  Which  solutions  are  obtained 
depends  on  the  scaling  of  the  problem  and  the  form  of  the  asymptotic  expansion 
assumed  for  the  solution. 

As  an  example,  consider  the  s  — >  0  limit  of  the  (immediately  solvable)  algebraic 
equation 

sx 2  —  2x  +  1  =  0.  (6.27) 

Substituting  s  =  0  into  (6.27)  gives 


—  2x  +  1  =  0  =>  x  =  \- 


(6.28) 


This  leading  order  equation  therefore  only  yields  one  of  the  two  roots  expected  from 
the  second-degree  equation  (6.27).  The  exact  solutions  of  (6.27)  can  be  written  as 


v  = 


1  ±  VT~^e  1  ±  (1  -  \s  -  ±e2  +  <9(e3)) 


from  which  we  see  that 


XA 


1  £  1  £  £2 

—  —  — ,  x  r  ^  —  T  —  T  — 
2  8  2  8  16 


(6.29) 


(6.30) 


Equation  (6.28)  produced  only  the  leading  order  term  of  the  regular  solution  x#. 
The  xa  solution  diverges  as  s  — >  0  and  cannot  be  expressed  in  terms  of  the  regular 
expansion  (6.17). 

Substituting  the  leading  term  from  each  of  these  solutions  back  into  (6.27)  helps 
identify  the  cause  of  the  discrepancy 


xa 


*(i)2-2(i)  +  l=0,  e(ir-2(i)  +  l=0. 


The  solutions  are  determined  by  different  dominant  balances  in  Eq.  (6.27).  For  xa  ,  the 
first  two  terms  balance  at  0(  1/s)  while  the  third  term  is  sub-dominant  at  0(1).  For 
xb,  the  second  and  third  terms  balance  at  O  ( 1 )  with  the  first  term  being  sub-dominant 
at  O(s)  (this  being  the  regular  solution  that  was  obtained  from  (6.28)). 
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6.5.1  Rescaling  to  Obtain  Singular  Solutions 

We  now  outline  a  systematic  procedure  for  how  the  previously  introduced  methods 
for  regular  perturbation  problems  can  be  extended  to  handle  singular  solutions: 

(1)  Substitute  v  =  <$o(£)X  into  the  given  problem. 

(2)  Choose  <$o(s)  so  as  to  produce  a  consistent  dominant  balance;  verify  that  all 
neglected  terms  are  indeed  sub-dominant.  Different  <$o(£)’s  lead  to  different 
dominant  balances,  and  considering  all  of  the  possible  choices  will  yield  all  of 
the  regular  and  singular  solutions. 

(3)  Factor  out  any  common  e-scalings  to  yield  a  regular  perturbation  problem  in 
X.  Then  solve  this  problem  using  either  the  iteration  or  expansion  methods  to 
generate  regular  solutions  corresponding  to  this  distinguished  limit. 

(4)  Rescale  X  by  <$o  1°  obtain  v  in  final  form. 

We  will  illustrate  this  methodology  on  the  example  from  the  previous  section. 
Substituting  v  =  <5ovo  into  (6.27)  yields 

s8%X2-2S0X+  1  =0.  (6.31) 

'~7i T'  "7C  (3) 

We  compare  the  orders  of  magnitude  of  the  terms  (cf.  Sect.  6.3.2).  There  are  three 
possible  balances: 

(a)  Terms  (2,  3):  <$o  =  1  =4>  <$o  =  1 

(b)  Terms  (1,  3):  s8 q  =  1  =>  <$o  =  1  /  y/s 

(c)  Terms  (1,  2):  s8 q  =  <$o  =>  <$o  =  1/e 

In  case  (a),  the  omitted  first  term  is  sub-dominant,  O(s),  and  so  this  is  a  consistent 

balance;  this  balance  yields  the  regular  solution,  xb  .  In  case  (b),  the  dominant  balance 
is  at  (9(1),  while  the  omitted  second  term  is  0(1/ ^/s)  1  in  the  limit  s  — >  0  and  is 

not  sub-dominant.  Hence  this  balance  is  inconsistent  and  must  be  rejected.  Finally, 
in  case  (c)  the  balancing  terms  are  0(  l/s),  while  the  third  term  is  0(1)  <<C  1/s , 
therefore  yielding  a  second  consistent  dominant  balance;  this  case  yields  the  singular 
solution. 

In  order  to  investigate  the  singular  solution,  we  write  x  =  X/s  and  substitute  into 
(6.27)  to  arrive  at  the  (rescaled)  regular  problem 

X2  -  2X  +  s  =  0,  (6.32) 

where  we  have  multiplied  through  by  s.  Expanding  X  (s)  as  a  regular  expansion  (here 
X  ~  2f()  T  sX\  +  £2X2  will  work)  and  seeking  the  leading  order  term  yields 

xl  -  2X0  =  0, 


(6.33) 
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which  has  solutions  Xo  =  0  and  Xo  =  2.  We  ignore  the  root  Xo  =  0  because  it 
is  not  a  nontrivial  leading  order  term/  The  other  root  yields  the  singular  solution 

xa  ~  2/s. 

A  particular  difference  between  the  iteration  method  described  in  Sect.  6.3.2  and 
the  current  methodology  is  worth  noting.  Both  seek  to  determine  the  leading  order 
gauge  function  <$o,  and  the  set  of  coefficients  xn  versus  Xn  will  be  identical.  The 
iteration  method  seeks  to  identify  one  successive  term  in  the  asymptotic  expansion 
at  each  iteration.  In  contrast,  the  rescaling  approach  re-casts  the  entire  problem  into 
a  new  (regular  perturbation)  form  for  X(s)  =  0(1)  (as  in  (6.32));  whether  to  then 
use  the  expansion  method  or  iteration  method  to  solve  for  X  is  left  as  a  separate 
decision.  In  terms  of  the  asymptotic  expansion  of  the  solution,  we  have 

V  ~  <$o2f(£) 

~  <$0  (xo  +  8\X\  +  <$2X2  +  <$3^3  +  *  *  *  ^ 

~  <$ovo  +  <$i*i  +  +  <$3*3  H - ,  (6.34) 

so  that  the  leading  order  gauge  function  <$o  scales  through  the  8  gauge  functions  in 
the  expansion  of  the  rescaled  solution. 

Singular  problems  for  ODE  and  PDE  introduce  additional  complexities  and  are  of 
particular  interest  as  they  frequently  arise  in  applications.  The  following  two  chapters 
consider  these  scenarios  in  detail. 


6.6  Further  Directions 

There  are  many  variations  of  the  methods  used  in  this  chapter.  Reference  [47]  employs 
an  iterative  procedure,  where  the  original  equation  must  be  written  in  a  form  that 
is  compatible  with  the  contraction  mapping  theorem  and  allows  for  greater  analysis 
of  the  convergence  of  the  asymptotic  expansion.  The  method  described  in  [49]  is 
somewhat  more  similar  to  that  presented  here;  there,  the  gauge  functions  are  assumed 
to  be  of  the  form  8n  =  san  where  an  need  not  be  an  integer.  Finally,  we  note 
that  many  further  approaches  for  constructing  asymptotic  expansions  for  integrals 
and  differential  equations  build  directly  on  the  perturbation  methods  for  algebraic 
equations  described  in  this  chapter  [11,  13,  29,  47,  72,  92]. 


2It  is  a  “ghost”  of  the  xb  regular  solution,  which  is  0(e)  in  terms  of  X  (and  violates  requirement 
(i)  in  Sect.  6.3.2). 
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6.7  Exercises 


6.1  Use  Taylor  series  expansions  for  sin  y  and  ey  for  y  0  and  the  basic  property 
exp(Z  k  at)  =  Uk  eak  to  derive  (6.10)  for  £  ->  0  from  x(t,  s)  =  exp  (sin  (£^/t)/£2) 
when  t  =  0(1),  but  v  ~  1  +  \[ef  +  \sT  when  t  —  £3T  with  T  =  0(1). 

6.2  Consider  the  limit  of  e  — >  0  for  the  equation 

(x  -  3)3  =  24sx2. 

Solve  by  iteration3  to  determine  v  ~  So(£)xo  +  5i(£)vi  +  S 2(s)x2. 

6.3  Consider  the  algebraic  equation  for  s  — >  0 

s6x3  —  5s3  x2  —  20£v  +  60  =  0. 


Show  that  there  are  no  regular  solutions  and  determine  the  leading  order  nontrivial 
term  in  the  expansion  of  each  of  the  three  solutions. 

6.4  Consider  the  projectile  problem  for  s  — >  0 


d2x  1 

dt 2  (1  +  sx )2  ’ 


jc(0)  =  1,  x'(0)  =  3£. 


(a)  Let  tmax  be  the  time  when  the  projectile  reaches  its  maximum  height.  How  many 
terms  in  the  expansion  of  x(t)  will  you  need  to  determine  £max  =  to  +  £t\  + 
0(£2)1 

(b)  Show  that  while  (6.26)  with  a  =  3£  could  have  been  used  to  determine  the 
solution  in  part  (a),  this  could  not  be  used  with  the  initial  condition  v^O)  =  4/e. 
Assume  a  singular  form  x(t)  =  X  (t) /e  ~  Xq  (t) /e  +  X\  (t)  to  find  this  solution. 

6.5  Consider  the  problem 

- b  £v2  +  t  =  0,  v(0)  =  0,  £  —>  0. 

dt 


(a)  Find  the  first  three  terms  in  the  expansion  of  the  solution,  v(t)  ~  vo(0  +  evi  (t)  + 
£2V2  (0- 

(b)  Determine  the  range  of  times,  0  <  t  <  0(£a),  for  which  the  terms  in  the 

expansion  retain  asymptotic  ordering,  i.e.  vq  £V\  £2V2  . 


3Use  of  a  computer  algebra  program  (Maple  or  Mathematica)  is  recommended  for  solving 

many  of  the  more  algebraically  intensive  problems. 
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6  Perturbation  Methods 


6.6  Consider  the  system  of  equations 


ex  -  y  =  1 
e2x  +  y  =  4 


in  the  limit  s  0. 

Explain  why  setting  s  =  0  does  not  lead  to  an  acceptable  leading  order  solution. 
Determine  the  first  two  terms  in  the  expansions  of  x(e),  y(£). 

6.7  Use  the  iteration  method  with  v  ~  5o(£)*o  +  £i(£)*i  1°  find  the  singular  real 
solution  of  the  equation 

2x2e~5x  =  8e,  s  — >  0. 


Note:  This  equation  has  three  real  roots;  the  two  vanishing  roots  are  not  of  interest 
here  (they  are  v  ~  ±2s1/2  +  10s).  The  one  you  should  locate  has  <$o(e)  — >  oo  (i.e. 
it  is  large  and  positive). 

Hint:  Take  the  logarithm  of  both  sides  of  the  equation  before  beginning  iteration. 

6.8  Re-consider  the  solutions  of  Eq.  (6.21),  but  now  in  the  limit  of  s  — >  oo:  let 
2  =  1  / s  and  find  the  real- valued  solutions  of 

sinv  +  s(x2  —  2x)  =  0,  2^0. 


Assume  v  ~  vo  +  ex\  +  s2x 2.  Note  that  for  2  =  0  the  leading  order  problem  has 
infinitely  many  solutions,  vo  =  nn  for  any  integer  n. 

(a)  Determine  x\,X2. 

(b)  Plot  F(x,  2)  =  sin  x  +  s(x2  —  2x)  for  2  =  y^.  How  many  zeroes  does  it  have? 

(c)  Parts  (a,  b)  show  that  there  is  a  conflict,  as  s  — >  0  there  is  a  finite  number  of 
solutions,  namely  there  is  a  maximum  value  for  n ,  call  it  N(s),  at  each  value 
of  2.  Requiring  that  the  expansion  for  v  remain  asymptotically  ordered  (vo 
sx\  . . .)  suggests  one  estimate  for  N ,  but  show  that  maintaining  ordering  in 
the  expansion  of  the  equation,  given  vo  and  x\  from  (a),  yields  the  correct  N(s). 

6.9  In  the  1 930s,  Carleman  showed  that  in  an  appropriate  asymptotic  limit  a  diffusion 
model  could  be  derived  from  a  system  of  reactive  wave  equations.  Consider  the 
system  of  PDEs  for  p(x,  t),  q(x,  t)  with  s  >  0: 


2  dp  dp 
£  - - b  £ 


dt 


dx 


=  q  -  p. 


=  p-q. 


(a)  If  the  solutions  are  uniform  in  space  (i.e.p  =  p(t),q  =  q(t)),  then  p ,  q  satisfy  a 
reversible  transformation  reaction  P  ^  Q  (see  Sect.  1.2).  What  is  the  conserved 
quantity  for  this  reaction  system?  What  is  the  reaction  rate? 

(b)  If  the  reactions  (the  right-hand  side  terms)  are  eliminated,  find  the  travelling 
waves  for  the  reduced  equations  for  p(x,  t)  and  q(x,t).  What  are  the  speeds  of 
these  waves? 
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(c)  Define  u  =  p  +  q  and  v  =  (p  —  q)/s.  Combine  the  full  equations  for  p  and  q 
to  obtain  two  equations  for  u  and  v.  In  the  limit  s  — >  0,  use  regular  perturbation 
expansions  u(x,  t)  ~  uo  +  eu\  +  •  •  •  and  v(x,  t)  ~  vo  +  evi  +  •  •  •  to  find  the 
leading  order  equations  for  w  and  v,  and  then  derive  a  PDE  for  uq(x,  t)  alone. 

6.10  The  nondimensional  form  of  the  shallow  water  equations  is 

3  h  3  u  3  h  3  u  3  u  1  3  h 

—  H-  /z  —  T  W  —  =  0,  —  -f  W  —  H-  — ^  —  —  0, 

dt  dx  dx  3 1  dx  Fr2  dx 

where  u(x,  t)  is  the  fluid  speed,  h(x,  t)  is  the  height  of  the  fluid,  and  Fr  is  a  dimen¬ 

sionless  parameter.  A  uniform,  steady  solution  is  given  by  h  =  1  and  «  =  1,  which 
can  represent  a  river  having  uniform  speed  and  depth. 

Consider  waves  generated  due  to  some  small  disturbance  on  this  steady  state.  Fet 

h  =  1  +  £q(x,  t),  u  =  1  +  £v(v,  0- 

(a)  For  s  — >  0,  determine  the  (9(e)  linearised  wave  equations  for  q,  v. 

(b)  Apply  the  approach  of  Sect.  2.4  to  the  linearised  system  to  determine  the  critical 
value  of  the  Froude  number  at  which  waves  from  the  disturbance  change  from 
spreading  in  both  up-  and  down-stream  directions  to  having  all  ripples  being 
swept  downstream  (called  sub-  and  super- critical  behaviours). 


Chapter  7 

Boundary  Layer  Theory 


We  have  seen  in  Chap.  6  that  singularly  perturbed  problems  can  have  co-existing 
regular  and  singular  solutions  that  scale  differently  as  s  — >  0.  In  the  context  of 
physical  systems  described  by  differential  equations,  such  structures  yield  multi-scale 
phenomena.  Everyday  life  yields  countless  examples  of  multi-scale  phenomena: 
violent  winds  in  tornadoes  surrounded  by  relatively  calm  air  over  large  areas,  bands 
of  wake  behind  ships  moving  in  otherwise  still  waters,  cracks  forming  in  uniform 
solid  materials,  spots,  stripes  and  other  intricate  patterns  developing  in  biological 
systems.  In  these,  and  many  other  contexts,  we  can  separate  the  behaviour  of  the 
system  into  regions  of  rapid  variation  of  quantities  of  interest  compared  to  other  larger 
scale  regions  of  slow  variation.  Models  that  can  capture  such  diverse  behaviours  will 
allow  for  multiple  distinguished  limits,  describing  balances  between  different  sets  of 
dominant  effects  in  different  regions.  The  relatively  narrow  regions  of  rapid  variations 
are  generally  called  boundary  layers. 

Although  boundary  layers  were  originally  formulated  to  describe  problems  in 
fluid  mechanics  and  aerodynamics  [76]  (where  they  generally  occur  on  the  boundary 
of  a  solid  object  passing  through  a  surrounding  uniform  fluid  flow),  they  also  describe 
solutions  in  broader  sets  of  contexts. 

While  attempting  to  directly  find  a  solution  of  the  full  problem  on  an  entire  domain 
directly  may  be  very  difficult,  constructing  “partial  solutions”  on  different  regions 
using  perturbation  expansions  can  be  straightforward.  Following  the  approach  in¬ 
troduced  in  Sect.  6.5,  different  dominant  balances  will  re-scale  the  full  model  into 
different  forms,  leading  to  different  regular  or  singular  solutions.  Further  analysis  is 
then  needed  to  assemble  the  partial  solutions  into  a  complete  solution  of  the  full  prob¬ 
lem.  This  is  accomplished  through  asymptotic  matching  and  leads  to  this  solution 
methodology  being  called  the  method  of  matched  asymptotic  expansions. 
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7  Boundary  Layer  Theory 


7.1  Observing  Boundary  Layer  Structure  in  Solutions 


In  order  to  illustrate  the  main  principles  behind  boundary  layers  and  matched  asymp¬ 
totics,  we  first  consider  a  problem  for  which  we  can  express  the  solution  exactly  and 
examine  how  it  can  be  separated  into  pieces  stemming  from  behaviours  at  different 
scales. 

Consider  the  linear,  constant  coefficient  ordinary  differential  equation 


£ 


for 


on  the  domain  0  <  v  <  1,  subject  to  the  boundary  conditions 


y(0)=0,  y(l)  =  1. 


(7.1a) 


(7.1b) 


Singular  behaviour  in  the  solution  should  be  expected  since  setting  s  =  0  in  (7.1a) 
reduces  the  equation  to  a  first  order  ODE,  whose  solution  can  satisfy  only  one  of  the 
boundary  conditions. 

The  exact  solution  of  (7.1)  for  any  £  >  0  is  given  by 


(7.2) 


(see  Fig.  7.1).  If  £  is  small  (0  <  £  <<C  1),  we  can  expand  the  arguments  of  the 
exponentials  to  yield 


Fig.  7.1  (Left)  solution  (7.2)  to  problem  (7. 1)  for  s  =  0.1.  (Right)  the  behaviour  of  the  solution  in 
the  limit  e  ->  0 
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exp(-[i +  §  +  •••]*)  - exp(-[f  -  2 +  •••]*) 
exp(— [2  +  §  +  •••])-  exp(-[f  -  5  +  •  •  •  ]) 

exp  (-  jx)  -  exp  (-fx) 

exP  (— 1)  —  exP  (— f) 


(7.3) 


The  size  of  the  domain  is  independent  of  s,  and  hence  one  of  the  spatial  scales  should 
be  x  =  (9(1)  as  s  ->  0,  as  represented  by  the  e~x /2  term  in  (7.3).  The  other  term 
there  depends  on  a  more  rapidly  varying  spatial  scale,  x/s,  in  which  a  small  (0(e)) 
change  in  x  yields  a  O  (1)  change  in  the  solution.  The  distinctness  of  these  two  scales 
makes  it  possible  to  construct  the  solution  to  (7.1)  by  seeking  its  dependence  on  each 
scale  separately. 

Each  spatial  scale  in  the  problem  is  associated  with  a  limiting  process  for  the 
solution  of  (7.1)  as  e  — >  0.  Fixing  x  =  0(1)  in  the  range  0  <  x  <  1  gives 


lim  y(x)  ~ 


e  x /2  —  e.s.t. 
£-!/2  _  e.s.t. 


e(l~x)/2, 


(7.4) 


where  “e.s.t.”  refers  to  exponentially  small  terms ,  of  the  form  e~a for  a  >  0,  that 
are  smaller  than  all  algebraic  powers  of  s  (, sn  e~a's  for  s  ->  0)  and  are  treated  as 
negligible  in  this  context.  This  limiting  form  of  the  solution  satisfies  the  boundary 
condition  at  x  =  1,  but  not  the  one  at  x  =  0  (7.1b). 

Note  that  we  have  excluded  the  case  x  =  0  from  consideration  in  (7.4)  so  that 
e-2x/s  js  in(jee(i  exponentially  small.  If  instead,  we  consider  a  small  neighbourhood 
of  the  origin,  0  <  x  =  0(s),  then  we  must  take  the  dual  limit  s  ->  0  and  x  — >  0 
with  the  ratio  X  =  x/s  held  fixed.  In  terms  of  the  new  spatial  variable  X,  the  limit 
of  (7.3)  now  becomes 


p-eX/ 2  _  -2X 

limy-— m - 277  ~  «‘/2d  “  e  )•  (7-5> 

This  limiting  form  satisfies  the  boundary  condition  at  x  =  0  (X  =  0).  The  boundary 
condition  at  x  =  1  is  not  satisfied,  but  since  the  right  hand  boundary  position, 
corresponding  to  X  =  1/s,  violates  the  assumption  X  =  0(1)  made  in  taking  the 
limit,  agreement  should  not  have  been  expected. 

Similar  examples  are  often  used  in  analysis  [21]  to  illustrate  functions  that  have 
non-uniform  convergence.  In  the  present  context,  we  see  that  different  limiting  prop¬ 
erties  of  the  solution  are  captured  by  different  limiting  processes.  For  the  majority 
of  the  domain  (here,  where  x  =  0(1)),  the  solution  is  given  by  (7.4)  and  is  called 
the  outer  solution.  In  contrast,  in  the  boundary  layer ,  or  inner  domain ,  the  solution 
exhibits  singular  behaviour  that  cannot  be  captured  by  the  outer  solution.  In  this 
example,  the  inner  solution  in  the  boundary  layer  close  to  x  =  0  has  a  singular 
derivative,  dy/dx  =  0(s~l)  — >  ooas£  — >  0. 
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It  is  also  worth  mentioning  that  problems  featuring  a  separation  of  scales  can  be 
extremely  difficult  to  compute  numerically  (typically  referred  to  as  stiff  problems).  In 
contrast,  solutions  of  these  problems  can  often  be  very  accurately  obtained  in  terms 
of  inner  and  outer  solutions  using  perturbation  methods. 


7.2  Asymptotics  of  the  Outer  and  Inner  Solutions 

We  now  discuss  the  construction  of  solutions  to  singular  perturbation  problems  by 
calculating  perturbation  expansions  for  the  outer  and  inner  solutions. 

We  shall  continue  to  use  problem  (7.1a,  7.1b)  to  illustrate  the  methodology.  We 
begin  by  attempting  to  find  the  outer  solution  on  the  outer  domain ,  0  <  x  <  1,  in 
which  we  assume  that  y(v)  and  all  of  its  derivatives  are  bounded,  smooth  and  0(1). 
We  assume  y(x)  can  be  expanded  as  a  regular  perturbation  expansion  of  the  form 

y(x)  ~  yo(x)  +  syi(x)  +  s2y2(x)  H -  as  s  -*  0.  (7.6) 

Similar  to  the  approach  used  for  (6.24),  we  substitute  this  expansion  into  (7.1a,  7.1b) 
and  separate  terms  in  powers  of  s  — >  0  yielding  the  system  of  sub-problems 


O(s0)  : 

+  Jo  =  0 

jo(l)  =  l, 

0(sl)  : 

2  y[  +yi  =  -Jo 

Ji(D  =  0, 

(7.7) 

0(s2)  : 

2j2  +  J2  =  -y'{ 

J2d)  =0, 

and  so  on  for  higher  powers  of  s.  Note  that  the  boundary  condition  at  v  =  0  from 
(7.1b)  is  not  included  above  since  v  =  0  is  not  within  the  outer  domain.  Solving  the 
0(1)  problem  gives  the  leading  order  outer  solution 

jo«  =  e(1-*)/2,  (7.8) 


which  reproduces  (7.4).  Additional  terms  in  the  expansion  can  be  constructed  by 
working  through  the  higher  order  sub-problems  in  (7.7)  in  sequence. 

We  now  turn  our  attention  to  constructing  the  solution  in  the  inner  region ,  cor¬ 
responding  to  having  v  =  0(s).  Introducing  the  rescaling  x  =  sX  and  writing 
y(v)  =  k(X)  transforms  (7.1a)  to 


d2Y 

dx2 


dY 

+  2 - +sY 

dX 


(7.9) 


with  the  boundary  condition  (7.1b)  becoming  Y (0)  =  0.  Seeking  the  solution  in  the 
form  of  a  regular  expansion,  Y (X)  =  Yo(X)  +  sY\  +  s2Y2(X)  +  •  •  • ,  yields  the 
leading  order  0(1)  problem 


Yq  +  2Fq  =  0,  F0(0)  =  0, 


(7.10) 
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with  solution 

Y0(X)  =  A(l-e~2X).  (7.11) 

Note  that  one  constant,  A,  remains  undetermined  in  the  solution  since  the  ODE  in 
(7.10)  is  second  order,  but  we  are  only  imposing  one  side  condition.  The  solutions 
of  the  higher  order  terms,  Yn(X)  will  similarly  add  one  new  undetermined  constant 
at  each  order  in  the  expansion.  Since  the  inner  problem  did  not  uniquely  define  the 
inner  solution,  we  examine  its  relationship  with  the  outer  solution  in  an  attempt  to 
fix  the  unknown  constant  A. 

It  is  important  to  note  that  the  inner  and  outer  domains  are  not  mutually  exclusive 
and  are  valid  on  regions  broader  than  the  strict  prescriptions  of  their  spatial  scales 
(here  v  =  0(s )  and  v  =  0(1)  respectively).  In  the  outer  solution  v  is  bounded 
away  from  zero,  but  can  become  small,  say  v  =  O(^fs)  — >  0.  Likewise,  in  the  inner 
solution  v  should  be  small,  but  X  =  x/e  can  become  large,  say  X  =  0(1  /  */e)  — >  oo. 

In  fact,  it  can  be  shown  that  there  is  an  overlap  domain , 

overlap  domain:  e  «  x  «  1,  (7.12) 

between  the  scales  set  by  the  inner  and  outer  domains,  where  both  inner  and  outer 
solution  are  valid  (see  Fig.  7.2),  Loosely  speaking,  there  is  a  range  where  x  is  small 
(small  enough  for  the  inner  solution  to  apply),  but  not  too  small  (where  the  outer 
solution  would  not  apply).  Since  the  original  full  problem  (7.1a,  7.1b)  has  a  unique 
solution;  if  the  inner  and  outer  solutions  are  both  valid  in  the  overlap  domain,  they 
cannot  be  two  distinct  solutions  and  must  in  fact  be  two  different  asymptotic  repre¬ 
sentations  of  the  same  solution.  This  relation  is  expressed  in  terms  of  limits  derived 
from  (7.12):  (i)  x  <<C  1,  the  outer  variable  must  approach  the  inner  domain,  x  — >  0 
and  (ii)  e  <  v  (or  after  dividing  across  by  s:  1  <<C  X),  the  inner  variable  must 
approach  the  outer  domain,  X  ->  oo.  The  resulting  limit  requirement 

lim  To (2Q  =  lim  yoOO,  (7.13) 

X^oo  x— >0 

is  called  the  leading  order  asymptotic  matching  condition  [60,  101].  This  principle 
can  be  paraphrased  as 

Fig.  7.2  A  schematic 
representation  of  the  inner, 
outer  and  overlap  domains 
for  problem  (7.1a,  7.1b) 


Overlap 


Outer 


0 


x 


1 


152 


7  Boundary  Layer  Theory 


“The  outer  limit  of  the  inner  solution  equals 

the  inner  limit  of  the  outer  solution.  ”  (7.14) 

Applying  (7.13)  to  Fo  given  by  (7.11)  and  yo  by  (7.8)  yields 

lim  A(  1  -  e~2X)  =  A  =  lim  e(l~x)/2  =  el/2,  (7.15) 

X^oo  x  0 

thereby  determining  the  constant  A  =  e 1/2.  With  this  value  for  A  the  inner  solution 
(7.11)  reproduces  the  limit  (7.5)  found  from  the  exact  solution. 

While  we  have  now  determined  the  leading  order  inner  and  outer  solutions  com¬ 
pletely,  only  a  little  more  work  is  needed  combine  the  outer  and  inner  solutions 
to  form  a  composite  representation  of  the  leading  order  solution,  denoted  here  by 
Tcomp(v),  valid  over  the  entire  domain  0  <  v  <  1.  An  appropriate  form  for  yComp(v) 
is  given  by  the  expression 


Tcomp(v)  =  yo  +  Yo  -  (overlap  from  matching),  (7.16) 

where  the  overlap  is  simply  the  contribution  found  through  matching  in  (7.15).  At  a 
formal  level,  on  most  of  the  domain,  we  have  y  ~  yo,  with  the  inner  solution  only 
becoming  significant  in  the  inner  domain.  Where  lb  becomes  important,  we  gain 
equal  contributions  from  both  the  inner  and  outer  solutions,  and  so  to  effectively 
prevent  “double-counting”,  we  must  subtract  off  the  overlap. 

Another  way  of  expressing  (7.16)  is  to  write 


Tcomp  —  TO  H"  1bLC> 


(7.17) 


where  the  boundary  layer  correction ,  Fblc>  is  the  adjustment  to  the  outer  solution 
made  by  the  boundary  layer  to  satisfy  the  boundary  condition,  with 


1b lc  =  lo  —  (overlap  from  matching), 


(7.18) 


where  we  expect  Fblc  — >  0  as  X  — >  oo. 

Writing  the  inner  solution  (7.11)  as  Fo  =  —  £(1-4x/£V2  an(j  using  the  overlap 

from  (7.15),  the  leading  order  boundary  layer  correction  is 


Iblc  =  e 


-  .1/2  _  „(l-4*/£0/2  _  el/2  =  _e(l-4x/s)/2 


(7.19) 


which  does  indeed  vanish  as  X  — >  oo.  Hence,  (7.17)  gives  the  leading  order  solution 


p-xf2  _  p-2x/e 

v  -  p<\-x)/2  _  p{\-4x/e)/2  _  L  1 

ycomp  —  ^  e  —  ^-1/2 


(7.20) 
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on  the  entire  domain  0  <  v  <  1 .  This  is  comparable  to  the  exact  solution  (7.2),  except 
for  the  absence  of  the  exponentially  small  e~2!£  term  (which  cannot  be  captured  in 
regular  expansions  such  as  (7.6)). 


7.3  Constructing  Boundary  Layer  Solutions 

The  presentation  in  the  previous  section  made  use  of  some  prior  knowledge  of  the 
form  of  the  solution  to  reduce  the  overall  problem  to  that  of  determining  the  ex¬ 
pansions  of  the  inner  and  outer  solutions.  In  this  section,  we  describe  the  further 
steps  required  to  investigate  boundary  layer  problems  without  any  additional  given 
information  about  the  form  of  the  solution. 

The  full  process  of  constructing  a  solution  involving  matched  asymptotic  expan¬ 
sions  requires  examining  the  two  additional  questions: 

•  What  are  the  scalings  for  the  inner  (and  outer)  solution(s)? 

•  Where  are  the  boundary  layer(s)  located? 

The  answers  to  these  questions  ultimately  determine  the  form  of  the  overall  solution, 
and  control  which  boundary  conditions  apply  to  the  inner  and  outer  solutions. 

For  many  singularly  perturbed  differential  equations,  solutions  can  be  constructed 
by  a  step-by-step  process: 

(1)  The  outer  solution :  If  the  problem  is  in  standard  form,  try  a  regular  expansion, 

y  (x)  =  yoOO  +  sy  l  (x)  H - ,  for  the  outer  solution.  If  all  of  the  boundary  con¬ 

ditions  can  be  satisfied  by  this  solution,  then  the  problem  is  complete;  otherwise, 
inner  regions  will  be  necessary. 

(2)  Find  the  dominant  balances :  The  appropriate  forms  for  all  of  the  regular  (outer) 
and  singular  (inner)  solutions  of  the  ODE  will  be  determined  by  the  distinguished 
limits  of  the  problem.  In  general,  both  the  independent  and  dependent  variables 
may  need  to  be  scaled  to  obtain  all  of  the  dominant  balances, 

y  =  s^Y(X),  X  =X~X*  <S>  x=x*+e“X,  (7.21) 

£a 

where  the  powers  a,  /3  and  the  assumed  position  of  the  boundary  layer,  v*  must  all 
be  determined  (there  may  be  multiple  valid  locations  for  x*).  The  outer  solution 
in  step  (1)  assumes  a  =  0,  /3  =  0. 

(3)  The  inner  solution :  For  the  singular  distinguished  limit,  write  the  problem  as  a 
rescaled  regular  problem  and  seek  the  solution  in  the  form  of  an  appropriate 
regular  expansion,  Y(X)  =  Yo(X)  +  sY\(x)  +  •  •  • . 

(4)  Asymptotic  matching :  Apply  asymptotic  matching  between  the  inner/outer  solu¬ 
tions  (typically  via  (7.1 3))  to  confirm  the  consistency  of  the  asymptotic  expansion 
and  determine  any  remaining  unknown  parameters  in  the  solution. 

(5)  The  composite  solution :  Writing  the  outer  and  inner  solutions  in  terms  of  the 
original  variables  and  subtracting  the  overlaps  from  the  matching  process  to 
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prevent  “double-counting”  will  produce  the  leading  order  solution  on  the  entire 
domain  (7.16).  So,  in  an  example  with  boundary  layers  at  both  the  left  and  right 
boundaries,  we  write 


VcompC-O  ~  JOU)  +  ^blc  +  Y£lc.  (7.22) 

This  description  covers  broad  classes  of  problems,  but  as  will  be  seen,  in  some  cases, 
steps  (2,  3,  4)  may  become  a  bit  intertwined.  We  also  note: 

•  When  boundary  layers  are  necessary,  which  boundary  conditions  apply  to  the 
outer  solution  may  not  be  immediately  apparent.  Hence  a  general  form  for  the 
outer  solution  will  be  needed  initially. 

•  The  boundary  conditions  as  well  as  the  ODE  play  a  role  in  determining  the  dom¬ 
inant  balances. 

•  The  location  of  the  boundary  layer  and  which  boundary  conditions  apply  to  the 
inner  solution  might  not  be  determined  until  matching  is  applied. 

•  If  the  inner/outer  solutions  are  not  matchable  (either  limit  does  not  exist,  or  equation 
(7.13)  cannot  be  satisfied)  then  the  assumed  choice  of  boundary  layer  position  v* 
or  dominant  balance  may  be  not  be  right. 

•  While  the  term  “boundary  layer”  stems  from  the  fact  that  the  inner  domain  often 
occurs  at  a  boundary,  in  some  cases,  they  can  also  occur  within  the  domain  of  a 
problem,  in  which  case  they  are  sometimes  called  interior  layers. 

Consequently,  the  reader  should  consider  steps  (l)-(5)  as  “guidelines”  that  may  need 
to  be  adjusted  depending  on  the  given  problem;  this  is  one  of  the  challenging  (and 
interesting)  points  of  matched  asymptotic  expansions. 

We  use  an  example  to  illustrate  the  aspects  of  the  above  procedure.  Consider  the 
boundary  value  problem 

d2y  dy 

s — y  H - =  cosv  (7.23a) 

dxz  dx 

in  the  limit  s  — >  0  on  the  domain  0  <  x  <  tv,  subject  to  the  boundary  conditions 

y(0)  =  2,  y(jt)  =  — 1.  (7.23b) 


7.3.1  The  Outer  Solution 

Assuming  that  y(x)  and  its  derivatives  are  bounded  as  s  ->  0,  we  write  the  outer 
solution  as  y  ~  yoOO  +  syiCO  +  £2yiM  •  •  • .  Substituting  into  (7.23a)  gives  the 
sequence  of  equations 

y'0  =  cos  x , 


0(e°) 
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and  so  on  for  higher  order  equations.  The  0(1)  problem  yields  yo  =  sin  v  +  A  and 
substituting  this  into  the  O(s)  equation  gives  yi (v)  =  —  cos  x  +  B.  We  can  proceed 
in  this  way  to  determine  as  many  terms  as  desired  in  the  expansion  of  the  general 
outer  solution 

yout  =  (sin  v  +  A)  +  e(—  cos  x  +  B)  +  0(s2).  (7.24) 

At  each  order,  there  is  only  a  single  constant  of  integration,  A,  B,  ... .  Imposing 
the  condition  at  v  =  0  from  (7.23b)  selects  A  =  2,  while  the  condition  at  v  =  it 
picks  A  =  —  1 ;  the  outer  solution  cannot  satisfy  both  at  once,  and  hence  a  boundary 
layer  will  be  required.  In  summary,  at  this  point,  we  do  not  know  which  boundary 
conditions  will  apply  to  the  outer  and  which  to  the  inner  solutions. 


7.3.2  The  Distinguished  Limits 

To  determine  the  relevant  scaling  of  the  singular  solution,  we  write  y(v)  =  s^Y(X) 
and  X  =  (x  —  x*)/sa  and  assume  that  Y(X)  =  0(1)  for  X  =  0(1).  Substituting 
into  (7.23a)  yields 


fil— 2a+fiY"  +  e~  a+f>Y'  =  cos(x#  +  sa  X),  (7.25) 

(1)  (2)  (3) 

where  0  <  v*  <  n.  We  also  note  that  both  boundary  conditions  (7.23b)  take  the 
form  s^Y  =  0(1),  and  hence  any  solution  local  to  a  boundary  must  have  =  0.  It 
remains  to  determine  a  from  the  possible  dominant  balances: 

(a)  Terms  (2,  3):  s~a  =  £°  =>  a  =  0, 

(b)  Terms  (1,  3):  sl~2oc  =  s°  =>-  a  =  1/2, 

(c)  Terms  (1,  2):  sl~2a  =  s~a  =>-  a  =  1. 


Option  (a)  is  the  regular  distinguished  limit  that  corresponds  to  the  outer  solution. 
Option  (b)  is  not  a  valid  balance  since  the  neglected  term  (2)  is  not  sub-dominant, 
0(£-1/2)  0(1).  Consequently,  the  boundary  layer  must  take  the  form  given 

by  (c)  where  the  neglected  term  (3)  is  sub-dominant  to  the  leading  balance  with 
0(1)  <  0(s~l).  We  note  that  in  some  problems,  the  dominant  balances  can  change 
for  different  assumed  positions  of  the  boundary  layer,  v*  (most  notably  for  non- 
autonomous  equations),  but  here  term  (3)  uniformly  satisfies  |  cos(v)  |  <  1  =  0(e°). 
Hence  our  scaled  equation  for  the  inner  solution  is  given  by 


d2Y  dY 

^  +  —  =  «°s(,.  +  £X), 


(7.26) 


where  v*  has  not  yet  been  determined. 
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7.3.3  The  Inner  Solution 


Having  the  inner  problem  in  regular  perturbation  form,  we  expand  Y  (X)  as  Y  ~ 
Fq  +  sY\  +  s2Y2  +  •  •  •  and  substitute  into  (7.26)  to  give  the  system  of  equations 


O(e0) 
0(sl) 
0(s 2) 


Yo +n=°’ 

Y\  +  Y'\  =  cos(x*), 

Y 2  +  ^2  =  -  sin(x*)  V  •  •  •  • 


The  0(1)  equation  yields  the  leading  order  inner  solution, 

Y0(X)  =  D  +  Ce~x , 


(7.27) 


(7.28) 


with  the  higher  order  problems  producing  smaller  corrections  to  this  result.  The 
constants  of  integration  C,  D  must  be  determined  by  boundary  conditions  or  by 
matching  with  the  outer  solution,  but  this,  in  turn,  depends  on  the  location  of  x*. 

Consider  the  forms  of  the  inner  domain  in  terms  of  X  =  (x  —  x*)/e  for  different 
possible  values  of  v* 

(i)  Left  boundary  (v*  =  0)  :  x  >  0  =>-  0  <  X  <  o(l/e) 

(ii)  Interior  :  0  <  v*  <  it  =>  —o(  1/e)  <  X  <  o(l /e) 

(iii)  Right  boundary  (x*  =  tv)  :  x  <  n  =>  —o(l/s)  <  X  <  0. 

These  options  correspond  to  three  possible  forms  of  the  composite  solution  (see 

Fig.  7.3):  (a)  a  left  boundary  layer  satisfying  y(0)  =  2  matching  with  an  outer 
solution  which  has  A  =  1  in  order  to  satisfy  the  right  boundary  condition,  (b)  a  nar¬ 
row  interior  transition  region  connecting  two  outer  solutions,  with  AL  =  2  and 
Ar  =  —1,  and  (c)  a  right  boundary  layer  satisfying  y(Tt)  =  —1,  with  an  outer 
solution  satisfying  y(0)  =  2. 

The  location  of  the  boundary  layer  will  be  determined  by  the  structure  of  (7.28) 
and  its  limiting  behaviour.  The  exponential  term  e~x  in  (7.28)  diverges  if  X  is  allowed 
to  become  large  and  negative.  Such  exponentially  diverging  terms  cannot  satisfy  the 


(a)  (b)  (c) 


0  71  0  7T  0  77 

Fig.  7.3  Three  hypothetical  sketches  of  the  conjectured  inner/outer  solutions  for  (7.23a,  7.23b) 
with  a  boundary  layer  a  at  the  (Left),  b  in  the  (Interior),  c  at  the  (Right)  edge  of  the  domain 
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asymptotic  matching  condition  (7.13)  (with  X  — >  —  oo  being  the  appropriate  form 
of  the  ‘outer  limit’  process)  and  are  un-matchable . 

Consequently  options  (ii)  and  (iii)  are  not  feasible,  and  we  conclude  that  the 
boundary  layer  must  be  at  v*  =0  with  the  left  boundary  condition  from  (7.23b) 
being  relevant,  namely  Y (0)  =  2.  Applying  this  condition  reduces  (7.28)  to 

Yo(X)  =  2  +  C(e~x  -l),  (7.29) 

where  C  remains  to  be  determined. 


7.3.4  Asymptotic  Matching 

Having  identified  the  position  of  the  boundary  layer  as  v*  =  0,  we  have  the  leading 
order  inner  solution  (7.29),  valid  on  0  <  x  <  0(s),  and  the  outer  solution  (7.24), 
valid  on  0  <  x  <  n .  Since  the  right  boundary  lies  in  the  outer  domain,  that  boundary 
condition  determines  A  =  —  1  in  (7.24),  leaving  C  from  (7.29)  as  the  last  remaining 
unknown. 

Applying  the  matching  condition  (7.13)  for  Yq(X  — >  oo)  and  yo(x  — >  0)  yields 
lim  2  +  C{e~x  -  1)  =  2  -  C  =  lim  sin(x)  -  1  =  -1  (7.30) 

X^oo  x  — >  0 

and  hence  C  =  3. 


7.3.5  The  Composite  Solution 

The  overlap  shared  in  common  by  the  leading  order  inner  and  outer  solutions  above 
is  —  1 .  Therefore  we  can  form  the  boundary  layer  correction  as 

*blc  =  Y0  -  (-1)  =  3e"x. 

Finally,  adding  this  correction  to  the  outer  solution  yields  the  leading  order  composite 
solution  on  0  <  x  <  n  (see  Fig.  7.4), 


ycomp  =  -1  +  sin(x)  +  3e  x/e 


(7.31) 


This  is  in  agreement  with  the  exact  solution  (valid  for  all  s  >  0), 
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Fig.  7.4  A  plot  of  ycomp(T) 
(7.31)  for  a  sequence  of 
e  ->  0.  The  boundary  layer 
becomes  narrower  as  e 
decreases 


7.4  Further  Examples 

We  present  further  examples  to  illustrate  other  aspects  of  the  method  of  matched 
asymptotic  expansions. 

Consider  the  problem  of  obtaining  the  leading  order  solution  to  the  ODE  problem 
on  0  <  x  <  1 , 

d2y  2 

s — 7T  —  (2  —  x2)y  =  —  1  for  s  — >  0,  (7.32a) 

dxz 

with  boundary  conditions 


/(0)  =  0,  y(l)  =  0.  (7.32b) 

We  begin  by  seeking  the  outer  solution  as  a  regular  perturbation  expansion,  y(x)  ~ 
TO  +  syi(x)  +  £2T2  +  •  •  •  •  The  equation  for  the  leading  order  term  is 

—  (2  —  x2)yo  —  —  1  ==>  yo(x)--d—~.  (7.33) 

2  —  xz 

The  leading  order  outer  solution  satisfies  the  /(0)  =  0  boundary  condition,  i.e. 
Tq(0)  =  0,  hence  no  boundary  layer  is  needed  there. 

The  outer  solution  has  no  free  parameters,  and  it  does  not  satisfy  the  boundary 
condition  at  x  =  1 .  Therefore  there  must  be  a  boundary  layer  at  v*  =  1 . 

So  we  seek  a  singular  solution  in  the  form  y(x)  =  s^Y(X)  withX  =  (x  —  \)/£afor 
X  <  0.  Unlike  the  inhomogeneous  conditions  in  (7.23b),  the  homogeneous  boundary 
condition  y(l)  =  0  does  not  provide  us  information  on  /3  since  s^Y( 0)  =  £^0  =  0 
for  any  /3.  However,  if  we  can  determine  /3  using  some  alternative  means,  it  will 
simplify  the  process  of  finding  the  distinguished  limit  for  the  inner  solution.  Turning 
to  the  asymptotic  matching  condition  provides  help;  here  this  condition  will  take  the 
form 

lim  yoOO  =  1™  £^To(X). 

X^-oo 


(7.34) 
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While  we  have  not  determined  Fo(20>  it  is  assumed  to  be  0(1),  and  for  v  — >  1,  we 
have  yo(l)  =  1.  Since  the  limit  of  the  outer  solution  is  0(1),  so  must  be  the  limit  of 
the  inner  solution,  hence  /3  =  0. 

Substituting  y(x)  =  Y(X)  and  x  =  1  +  £aX  into  (7.32a)  yields 

e1-2 aY"  -  (2  -  (1  +  saX)2)Y  =  -1 

and  in  final  form: 

£i-2ory'/_(1  _  2eaX  -  e2aX2)Y  =  — 1  .  (7.35) 

(1)  (2)  (3) 

On  a  finite  domain,  a  >  0  with  a  >  0  describing  boundary  layers  that  narrow  like 
O  (s01)  as  £  0.  We  note  that  for  a  >  0  the  sum  of  terms  in  the  parentheses  in  term 

(2)  have  leading  order  term  (2  —  x2)  ~  1,  so  the  higher  order  terms  there  cannot 
contribute  to  a  consistent  dominant  balance. 

Balancing  (2,  3)  yields  the  distinguished  limit  for  the  outer  solution,  a  =  0, 
with  term  (1)  being  sub-dominant,  0(£)  <<C  0(1).  The  other  distinguished  limit  is 
a  =  1/2,  which  balances  all  three  terms  at  0(1). 

We  now  have  the  form  of  the  inner  problem  as 

Y"  —  (1  —  2e1/2X  —  sX2)Y  =  — 1,  y(0)  =  0,  (7.36) 

for  X  <  0.  The  presence  of  the  el/2  suggests  the  expansion  of  the  solution  should 
take  the  form  Y  (X)  ~  Yq  +  e 1  /2  Y i  +  s  Y2  +  •  •  • .  The  leading  order  ODE  is 

Yq  —  Yq  =  —l,  (7.37) 

with  solution 

Y0(X)  =  Ae~x  +  Bex  +  l.  (7.38) 

For  this  solution  to  be  matchable  to  the  outer  solution  as  X  — >  —  oo,  we  must  take 
A  =  0.  Applying  the  boundary  condition,  Yo(0)  =  B  +  1  =  0  then  gives  us  the 
solution, 

Yq(X)  =  l-  ex.  (7.39) 


Since  neither  the  leading  order  inner  or  outer  solutions  have  any  undetermined  con¬ 
stants  they  should  match  automatically.  This  is  indeed  the  case,  and  (7.34)  applied 
to  (7.33)  and  (7.39)  shows  they  match  with  an  overlap  limit  of  1.  Hence  we  can 
construct  the  leading  order  composite  solution  (see  Fig.  7.5) 


1 

2  —  x2 


—  e 


Jcomp 


O-l  )/yfe 


(7.40) 
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Fig.  7.5  Plot  of  (7.40)  for 
e  —  10_n  with  n  =  2,  3,  4,  5 


Getting  a  solution  accurate  to  higher  orders  would  involve  obtaining  further  terms  in 
the  expansions  of  the  inner  and  outer  solutions.  Despite  the  fact  that  the  expansions 
have  different  gauge  function  ( sn  vs.  em /2)  the  solutions  must  match  together.  One 
approach  for  performing  matching  to  higher  order  is  given  in  Exercise  7.6. 

We  now  make  one  change  to  (7.32a)  and  illustrate  how  dramatically  the  structure 
of  the  solution  is  affected;  consider  the  ODE  problem  on  0  <  v  <  1, 

s — ^  —  (1  —  x2)y  =  —1  for  s  0,  (7.41a) 

dxz 

with  boundary  conditions 

/(0)  =  0,  y(l)  =  0.  (7.41b) 

The  only  change  from  the  previous  example  is  that  the  coefficient  (2  —  x2)  in  (7.32a) 
has  been  replaced  by  (1  —  x2). 

As  before,  the  outer  solution  can  be  expressed  as  a  regular  expansion  and  the 
leading  order  solution  is  given  by  an  algebraic  equation, 

yo  =  r^,  (7.42) 

which  blows  up  as  x  — >  1  and  does  not  satisfy  the  boundary  condition  y(l)  =  0. 
We  again  conclude  that  there  must  be  a  boundary  layer  at  v*  =  1 ,  but  now  face  the 
problem  of  determining  a  boundary  layer  solution  that  can  match  to  a  diverging  outer 
solution. 

We  seek  an  inner  solution  in  the  scaled  form  y(x)  =  s^Y{X)  withX  =  (x-l)/ea, 
where  we  expect  p  <  0  to  capture  the  singular  nature  of  the  magnitude  of  the  solution 
and  a  >  0  for  a  narrow  boundary  layer.  Substituting  into  (7.41a)  yields 

gi-2  a+fiY"  +  ea+fs  X  (2  + eaX)Y  = 

(D  (2) 


(3) 


(7.43) 


7.4  Further  Examples 


161 


We  now  consider  the  options  for  two-term  dominant  balances  in  this  equation: 

(a)  Terms  (1,  2)  balance  if  1  —  2a  +  /3  =  a  +  f$,  namely  a  =  1/3.  To  ensure 

that  the  balance  is  consistent,  and  these  terms  are  larger  than  term  (3),  we  need 
a  +  <0,  i.e.  /3  <  —1/3. 

(b)  Terms  (1,  3)  balance  if  1  —  2a  +  =  0,  yielding  /3  =  2a  —  1.  Term  (2)  is  sub¬ 

dominant  if  a  +  ft  >  0.  Consequently,  this  and  the  condition  <  0  determine 
the  range  1  /3  <  a  <  1/2. 

(c)  Terms  (2,  3)  balance  if  a  +  =  0,  hence  =  —a.  Term  (1)  is  sub-dominant 

when  1  —  2a  +  /3  >  0,  yielding  0  <  a  <  1/3. 

It  can  be  useful  to  visualise  these  relations  in  the  (a,  fi)  parameter  plane  in  what  is 
called  a  Newton-Kruskal  diagram  [105],  see  Fig.  7.6. 

The  leading  order  equations  proposed  by  each  of  the  above  respective  cases  are: 

Yq  +  2XY0  =  0,  (7.44a) 

Yq  =  —  1,  (7.44b) 

2  XY0  =  -1.  (7.44c) 

Each  of  these  equations  can  be  shown  to  have  some  deficiency  in  trying  to  describe 
the  inner  solution.  The  solution  of  (7.44c),  To  =  —  1  /(2X),  cannot  satisfy  the  bound¬ 
ary  condition  at  X  =  0.  The  solution  of  (7.44b)  is  a  parabola  that  cannot  satisfy  the 
asymptotic  matching  condition  for  X  — >►  —  oo.  Equation  (7.44a)  is  less  straightfor¬ 
ward;  it  is  a  version  of  Airy’s  differential  equation  [11,  105]  but  it  can  likewise  be 
shown  that  its  solutions  also  cannot  satisfy  the  matching  condition  (7.34). 

The  above  balances  are  self-consistent,  but  because  they  are  not  the  most  general 
dominant  balance,  they  actually  have  a  limited  range  of  validity  with  the  inner  domain 
(it  can  be  shown  that  (a)  holds  for  X  =  0(1),  (b)  holds  for  X  — >  0,  (c)  holds  for 
X  — >  —  oo).  The  distinguished  limit  for  inner  problem  is  given  by  the  intersection 
of  the  three  cases,  a  =  1/3,  /3  =  — 1/3,  with  all  three  terms  in  (7.43)  balancing, 

Yq  +  2X  T0  =  —  1 ,  T0(0)  =  0.  (7.45) 


Fig.  7.6  Newton-Kruskal 
diagram  for  (7.43)  showing 
possible  two-term  dominant 
balances  for  cases  (a,  b,  c )  as 
line  segments 
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Fig.  7.7  Numerical 
solutions  of  (7.41a,  7.41b) 
for  a  sequence  of  sn  ->  0 
0 colour  curves )  and  the  outer 
solution  (7.42)  ( black  curve ) 


X 


Another  way  to  come  to  this  choice  of  scalings  is  to  use  the  matching  condition 
(7.34)  with  the  outer  solution  (7.42)  written  in  terms  of  X  as, 


TO 


1 

1  -  (1  +£aX)2  ~ 


1 

2s*  X 


(7.46) 


which  we  can  recognise  as  case  (c)  above  (for  X  — >►  —  oo)  with  /3  =  —a.  This  would 
reduce  (7.43)  to  an  equation  for  a,  having  two  distinguished  limits:  a  =  0  (the  outer 
solution),  and  the  boundary  layer  given  by  a  =  1/3. 

Constructing  the  composite  solution  would  require  solving  (7.45)  (it  is  an  inho¬ 
mogeneous  version  of  Airy’s  equation)  and  carrying  out  the  matching  from  (7.46) 
using  the  approach  of  Exercise  7.6.  Instead,  in  Fig.  7.7  we  show  that  the  numerical 
solution  of  the  full  problem  is  well  characterised  by  the  outer  solution  on  most  of 
the  domain  with  the  maximum  value  of  the  solution  and  the  width  of  the  boundary 
layer  being  well-predicted  by  the  scaling  of  the  inner  solution. 


7.5  Further  Directions 

The  problems  we  have  considered  above  provide  some  insight  into  how  boundary 
layers  and  matched  asymptotic  expansions  can  separate  out  some  of  the  delicate 
behaviours  of  solutions  of  singularly  perturbed  problems.  The  examples  have  shown 
that  while  the  steps  outlined  in  Sect.  7.4  are  a  good  guide,  they  may  be  coupled  to 
each  other  in  different  ways  in  each  problem — for  example:  are  the  scalings  of  the 
inner  solution  determined  by  the  ODE,  the  boundary  conditions,  or  by  matching?  is 
matching  needed  to  set  undetermined  coefficients  in  the  inner  or  outer  solution  (or 
neither  or  both)?  Many  problems  require  creative  application  of  these  steps.  Analysis 
of  some  more  challenging  problems  remain  open  research  problems. 

The  form  of  the  solutions  obtained  to  such  problem  generally  provides  greater 
insight  into  the  nature  of  the  system.  The  dominant  balance  that  determines  the 
leading  outer  solution  provides  the  simplest  essential  approximation  of  the  behaviour 
in  the  problem.  This  approximation  will  be  valid  everywhere  apart  from  the  boundary 
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layers.  Matched  asymptotics  provides  understanding  of  whether  the  boundary  layers 
are  necessary  to  pin-down  properties  of  the  outer  solution  or  merely  correct  the 
solution  in  narrow  regions.  Often,  knowledge  of  the  physical  system  being  modelled 
can  guide  expectations  on  boundary  layer  positions  and  dominant  balances;  this  can 
sometimes  simplify  the  mathematical  steps.  Sometimes,  the  matched  asymptotics 
will  uncover  unexpected  dominant  balances  that  can  highlight  novel  and  important 
behaviours  in  the  problem. 

This  chapter  only  hints  at  the  broad  array  of  models  that  can  be  studied  using 
matched  asymptotics  and  the  types  of  behaviours  that  can  result.  Some  of  these 
include  boundary  layers  within  boundary  layers  (nested  layers  or  “triple  decks”), 
boundary  layers  that  begin  at  higher  orders  (“corner  layers”),  and  problems  with 
unusual  gauge  functions.  There  is  a  wide  array  of  books  that  give  further  studies  of 
boundary  layer  problems  [47, 48,  58, 78,  92]  and  some  primary  sources  on  the  theory 
of  asymptotic  matching  are  [60,  101]. 


7.6  Exercises 

7.1  Evaluate  lim^o  e~l^s /sn  to  demonstrate  that  exponentially  small  terms  are 
smaller  than  all  algebraic  terms. 

7.2  Determine  the  three  possible  dominant  balances  for  (7.25)  that  could  occur  for 
x*  =  7r/2,  noting  that  cos(v)  ->  0  as  v  — >  v*. 

7.3  Consider  the  problem  for  y(v)  on  0  <  v  <  1  with  s  — >  0, 

B  77  —  (4  —  x2)y  =  cos  (§*)  >-(())  =  -i  y(  i)  =  2 

(a)  Determine  the  two  distinguished  limits  for  this  problem. 

(b)  Write  the  leading  order  outer  solution  yoOO- 

(c)  Find  the  leading  order  inner  solutions  To  (20- 

(d)  Write  the  leading  order  uniformly- valid  solution. 

7.4  Consider  the  initial  value  problem  for  y(v)  on  0  <  v  for  s  ->  0, 

d2y  dy  ,  4 

+2 /  -  6y  =  5jc,  y(0)  =  0,  /(0)  = 

dxz  dx  sz 

You  are  given  that  the  solution  has  a  boundary  layer  at  v*  =  0. 

(a)  Determine  the  leading  order  inner  solution. 

(b)  Write  the  outer  limit  of  the  inner  solution  to  determine  a  necessary  matching 
condition  on  the  outer  solution. 

(c)  Determine  the  leading  order  outer  solution  for  v  >  0. 
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7.5  Consider  the  problem  for  y(x)  on  0  <  v  <  1  with  s  ->  0, 

d2y  dy 

8-^  +  2/  +  ^  =  0,  y(0)  =  0,  y(l)  =  0. 

dxz  dx 

(a)  Find  the  general  leading  order  outer  solution  yoOO- 

(b)  Find  the  leading  order  inner  solution  lo  (20  and  determine  where  the  boundary 
layer  occurs. 

(c)  Write  the  leading  order  composite  solution. 

(d)  Determine  the  next  term  in  the  outer  solution,  y\(x). 

7.6  Consider  the  problem  for  y(x)  on  0  <  x  <  1  with  s  — >  0, 

d2y  dy 

/  +  T  =  2jc,  y(0)  =  -2,  y(l)  =  1. 
dxz  dx 

(a)  Determine  the  distinguished  limit  for  the  inner  problem  for  this  equation.  By 
solving  the  leading  order  inner  problem,  determine  where  the  boundary  layer 
occurs. 

(b)  Obtain  the  first  two  terms  in  the  expansion  of  the  inner  solution  (with  the  appro¬ 
priate  boundary  conditions  imposed), 

Y(X)^Yo(X)^8Yl(X). 

(c)  Determine  the  first  two  terms  in  the  expansion  of  the  outer  solution  (with  the 
appropriate  boundary  conditions  imposed), 

y(x)  ~  yo(x)  +  ey\(x). 

(d)  The  inner  solution  will  have  undetermined  constants.  These  constants  can  be 
determined  using  higher-order  matching  using  intermediate  variables  [60,  101] 
via  the  following  steps: 

(i)  Analogous  to  (7.12),  define  small  parameter  rj  with  ea  rj  <<C  1. 

(ii)  Define  the  intermediate  variable  x  as  x  =  (x  —  x*)/rj. 

(iii)  Use  the  relations 

,  -  v  r]- 

x  =  x*  +  rix,  X  =  — v 

to  write  the  outer  and  inner  solutions  both  in  terms  of  the  intermediate 
variable  x  and  e,rj. 

(iv)  Use  the  relations  8a  «  rj  «  1  to  expand  out  exponential  functions  or 
eliminate  small  terms  in  the  solutions  on  the  overlap  region,  with  x  =  O(l). 

(v)  Arrange  the  remaining  terms  as  an  ordered  asymptotic  expansion  involving 
rj,  8  and  determine  the  remaining  constants  through  matching  of  terms. 

(e)  Write  the  uniform  solution  valid  up  through  O(s)  terms. 
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7.7  Consider  the  problem  for  y(x)  on  0  <  v  <2  with  s  — >  0, 

9  9  dy  3 

x2  +  /  =  4-e/,  y(0)  =  -. 

(a)  Determine  the  first  two  terms  in  the  outer  solution, 

y  ~  yo  +  eyi- 

(b)  Boundary  layers  can  occur  at  v*  =0  and  v*  =  2.  For  each  case,  determine 
the  a,  ft  for  each  distinguished  limit  and  write  its  corresponding  leading  order 
equation  for  YoC^O-  Note  that  there  are  two  singular  distinguished  limits  at 
x*  =  0. 

7.8  Consider  the  problem  for  y(x)  on  0  <  x  <  1  for  e  — >  0, 


£ 


e2y 

(x-l)3’ 


y(  0)  =  o. 


y(i)  =  o. 


The  leading  order  outer  solution  is  yoM  =  4. 

(a)  Determine  the  leading  order  inner  solution  for  the  boundary  layer  at  x*  =0. 

(b)  At  x*  =  1,  there  are  two  different  distinguished  limits.  Determine  a  for  each 
and  obtain  the  respective  leading  order  equations  for  each  To(2f). 

Since  these  solutions  have  different  a9 s,  one  layer  is  nested  inside  the  other.  The 
“inner-inner”  layer  solution  should  satisfy  boundary  condition  at  x  =  1.  The 
(wider)  “intermediate  inner”  layer  should  asymptotically  match  to  the  outer  and 
inner-inner  solutions  for  the  limits  X  — >  —  oo  and  X  — >  0  respectively  in  this 
triple  deck  problem. 


Chapter  8 

Long- Wave  Asymptotics  for  PDE  Problems 


In  this  chapter  we  will  study  matched  asymptotics  and  boundary  layer  theory 
applied  to  some  classes  of  multi-dimensional  problems  for  partial  differential  equa¬ 
tions  (PDE).  Perturbation  methods  offer  an  interesting  alternative  way  to  construct 
solutions  that  will  provide  different  insight  into  the  structure  of  some  PDE  problems. 
We  will  also  see  that  matched  asymptotics  will  allow  us  to  solve  problems  that  cannot 
be  tackled  directly  by  classical  methods. 

We  will  illustrate  the  analysis  in  the  context  of  problems  for  Laplace’s  equation, 


V2U  = 


92U  92U  _ 

9X2  +  3Y2  “  °‘ 


(B.l) 


The  Laplacian  operator  V2  is  a  fundamental  element  of  equations  describing  various 
phenomena  such  as  diffusion,  wave  propagation,  and  equilibrium  potentials 


91)  ,  92U  ,  , 

—  =  V2L),  _  =  v2u,  F(X,Y)  =  V2U. 

These  equations  arise  in  electromagnetism,  heat  conduction,  mass  transfer,  fluid  flow, 
solid  mechanics,  and  many  other  areas. 

We  will  focus  on  electrostatics  as  an  example  application — in  this  case  U(X,  Y) 
represents  the  electric  potential  (voltage)  that  drives  the  flow  of  charges  and  currents 
in  a  conductor.  In  particular,  we  will  describe  the  current  flow  and  electric  charge 
density  in  a  piece  of  wire  by  solving  Eq.  (8.1)  on  a  long  slender  domain,  subject  to 
various  boundary  conditions. 


8.1  The  Classic  Separation  of  Variables  Solution 

We  begin  by  briefly  reviewing  the  traditional  separation  of  variables  approach  to 
solving  boundary  value  problems  for  Laplace’s  equation  [23,  44]. 
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For  elliptic  PDE  such  as  Laplace’s  equation,  to  uniquely  determine  a  solution,  a 
problem  must  provide  boundary  conditions  around  the  entire  boundary  of  the  domain. 
In  the  context  of  electrostatics,  Dirichlet  boundary  conditions,  where  the  value  of 
the  solution  is  specified,  correspond  to  imposing  a  known  voltage  on  the  edge  of  the 
domain,  e.g.  U  (X  =  0,  Y)  =  U.  In  contrast,  Neumann  boundary  conditions  provide 
the  value  of  the  directional  derivative  of  the  solution  normal  to  the  boundary 


dU 

dn 


=  n  •  VU, 


(8.2) 


where  n  is  the  unit  normal  vector,  perpendicular  to  the  boundary,  in  the  outward 
direction,  e.g.  n  =  — i  is  the  normal  to  the  left  boundary  of  a  domain  given  by  X  >  0, 


dll 


dn 


X=0 


-i-  (dxU,  dYU) 


dll 

dX 


=  J  00- 

x=o 


Physically,  Neumann  conditions  specify  the  current,  or  flux  of  the  solution,  out  of 
the  domain. 

Consider  Laplace’s  equation  on  a  rectangular  domain, 


92U  92U  _ 

9X2  9Y2  _  ° 


0  <  X  <  L 


0  <  Y  <  H 


subject  to  Dirichlet  boundary  conditions, 


(8.3a) 


U(X,0)=0,  U(L,  Y)  =  0,  U(0,  Y)  =  0,  U(X,  H)  =  F(X).  (8.3b) 


This  is  an  elementary  ‘building-block’  problem  where  the  form  of  the  solution  will 
be  entirely  due  to  the  one  inhomogeneous  boundary  condition.  Solutions  to  problems 
on  the  same  domain,  but  with  inhomogeneous  boundary  conditions  on  other  edges, 
can  be  built-up  from  linear  superposition  of  combinations  of  such  building  block 
solutions. 

Since  this  is  a  linear  problem,  the  overall  solution  can  be  obtained  as  a  super¬ 
position  of  linearly  independent  trial  solutions.  The  trial  solutions  can  be  sought  in 
separation  of  variables  form,  as  a  product  of  functions  of  the  independent  variables: 

oo 

U(X,Y)  =  ^c„U„(X,Y)  with  U„(X,  Y)  =  A„(X)B„(Y).  (8.4) 

//— 1 

Substituting  (8.4)  into  the  homogeneous  boundary  condition  11(0,  Y)  =  0  for  0  < 
Y  <  H  with  the  assumptions  that  the  solution  1)  is  nontrivial  (not  all  coefficients 
Cn  =  0)  and  that  the  \Jn  trial  solutions  are  linearly  independent  yields  boundary 
conditions  on  the  An(X)  functions  for  ^  =  1,  2,  3,  ...  , 


^c„A„(0)B„(Y)  =  0 


for  0  <  Y  <  H 


>  A„(0)  =  0. 
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Similarly  for  the  other  homogeneous  boundary  conditions,  U(L,  Y)  =  0  yields 
An(L)  =  0  while  U(X,  0)  =  0  gives  Bn(0)  =  0.  The  inhomogeneous  boundary 
condition  in  (8.3b)  will  be  approached  differently. 

Substituting  Un(X,  Y)  from  (8.4)  into  (8.3a)  and  requiring  each  trial  solution  to 
satisfy  Laplace’s  equation  yields 

d2An  d2Bn 

^b„(Y)  +  a„(X)^  =  o 

which  can  be  re-arranged  to  give 

A"(X)  _  B"(Y)  _  „  _  , 

A„(X)“  B„(Y)  -  S”  -  ±A”' 

Since  the  equality  must  hold  for  all  independent  values  of  X  and  Y,  both  sides  must 
take  the  same  constant  value  sw,  called  a  separation  constant ,  which  here  is  written  as 
a  sign  times  the  undetermined  positive  constant  where  the  subscript  indicates 

that  the  separation  constants  are  generally  distinct  between  different  trial  solutions. 
The  main  result  of  this  separation  of  variables  approach  is  to  split  the  PDE  into  two 
separate  ODEs  for  An(X)  and  B^(Y)  that  are  linked  only  through  Xn. 

In  order  to  make  further  progress  it  is  convenient  to  begin  by  analysing  the  ODE 
problem  satisfying  homogeneous  boundary  conditions.  In  this  case  this  selects  the 
problem  in  X-direction  for  A/Z(X).  It  can  be  shown  that  subject  to  the  boundary 
conditions,  obtaining  nontrivial  solutions  of  A"  —  SnAn  =  0  forces  Sn  to  be  negative, 
Sn  =  —Xn  [23,  44].  The  problem  for  { An ,  Xn]  is  an  eigenvalue  problem , 

A"  +  XnAn  =  0,  A„(0)  =  0,  A„(L)  =  0,  (8.5) 

yielding  an  infinite  sequence  of  oscillatory  eigenfunction  solutions 

2  2 

/nit  \  n  7i 

An(X)  =  sin  y— Xj,  Xn  =  —^j-,  n=  1,2,3,....  (8.6) 

Having  obtained  the  separation  constants,  the  ODE  for  Bn(Y)  is  now  completely 
specified, 

B';  -  ^Bn  =  0  =*  Bn(Y)  =  Ci  sinh(^fY)  +  C2  cosh(^fY).  (8.7) 

Imposing  the  Bn( 0)  =  0  boundary  condition  reduces  the  general  solution  to  Bn(Y)  = 
sinh(^Y).  The  full  solution  (8.4)  then  takes  the  form 


oo 

U(X,  Y)  =  ^c„  sinh(^Y)  sin(^fX); 


(B.8) 
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all  that  remains  is  to  determine  the  coefficients,  cn.  Applying  the  inhomogeneous 
boundary  condition  to  the  series  at  Y  =  H  yields, 


U(X,  H)  = 


oo 

^c„sinh(^fH)sin(^fX)  =  F(X) 

n=l  - 7 - ' 

Jn 


This  equation  can  be  interpreted  as  the  Fourier  sine  series  for  the  function  F(X)  (see 
Appendix  A)  and  determines  the  coefficients  in  the  expansion, 

oo  L 

Y  fn  sin(^X)  =  F(X)  =*  fn  =  r  F(X)  sin(^fX)  dX. 

n=  1  L  Jo 


Finally,  expressing  cn  in  terms  of fn,  we  obtain  the  solution  in  the  form  of  an  infinite 
series 


U(X,  Y) 


2 

Lsinh(^H) 


F(X)  sin(^fX)  JX^  sinh(^Y)  sin(^X). 

(8.9) 


8.2  The  Dirichlet  Problem  on  a  Slender  Rectangle 


Solution  (8.9)  holds  for  all  choices  of  H,  L  but  working  with  an  infinite  series  can  be 
somewhat  cumbersome.  We  will  now  see  how  a  much  more  compact,  but  equivalent 
form  of  the  solution  can  be  obtained  for  slender  rectangles,  where  the  aspect  ratio 
H/L  <  1,  with,  for  the  most-part,  a  dramatic  simplification  of  the  PDE  problem  to 
a  sequence  of  simple  ODE  problems. 

Consider  a  problem  for  Laplace’s  equation  on  the  rectangular  domain,  0  <  X  <  L, 
0  <  Y  <  H  (see  Fig.  8.1): 


a2u  a2u 

9X2  +  9Y2 


(8.10a) 


subject  to  the  boundary  conditions 


U(X,  0)  =  0,  U(X,  H)  =  U/(X/L),  (8.10b) 

U(0,  Y)  =  Ug0(Y/H),  U(L,  Y)  =  Ugj(Y/H),  (8.10c) 


where/ (x),  c>o (y).  gi(y)  are  given  functions. 
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Fig.  8.1  The  domain  for  problem  (8.10),  a  slender  rectangle  in  dimensional  coordinates  (Left)  and 
the  rescaled  dimensionless  domain  (Right) 


We  nondimensionalize  using  the  scalings 

X  =  L.\\  Y  =  Hy,  \J  =  Gu(x,y),  (8.11) 


yielding  the  scaled  problem  on  0  <  x  <  1  and  0  <T<  1, 


£  tlxx  +  Myy  —  0 

(8.12a) 

u(x,  0)  =  0,  u(x,  1)  =f(x) 

for  0  <  v  <  1 

(8.12b) 

u(0,y)  =  g0(y),  u(l,y)  =  gi(y) 

forO  <  y  <  1 

(8.12c) 

where  the  aspect  ratio,  or  slenderness  parameter  s 

=  H/L  is  small,  £  - 

■>  0,  corre- 

sponding  to  a  long,  thin  domain. 

Using  s  as  a  perturbation  parameter,  we  begin  by  seeking  an  outer  solution  of  the 
form 

u(x,  y)  ~  uo(x,y)  +  s2ui(x,y)  +  s4u2(x,y)  H - .  (8.13) 

Note  that  for  this  problem  it  is  sufficient  to  use  an  expansion  having  only  even  powers 
of  £  because  the  perturbation  parameter  appears  in  the  problem  only  as  s2  in  (8.12a). 
Substituting  into  (8.12a,  8.12b),  we  obtain  a  sequence  of  problems  which  are  each 
essentially  ODE  boundary  value  problems  in  the  y-direction  with  x  entering  only  as 
a  secondary  parameter.  At  leading  order  we  get 

uoyy  =  0,  uo(y  =  0)=0,  uo(y  =  1)  =/(*),  (8.14a) 

where  the  differential  equation  determines  uo  to  be  linear  with  respect  to  y,  uo  = 
C\y  +  C2.  The  C  coefficients  need  not  be  constants;  they  must  be  independent  of  y 
but  can  depend  on  any  other  variable(s)  present  in  the  problem,  namely  uq(x,  y)  = 
Ci(x)y  +  C2OO.  Applying  the  boundary  conditions  from  (8.14a),  we  arrive  at  the 
leading  order  solution 


uo(x,y)  =f(x)y. 


(8.14b) 
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Similarly,  at  higher  orders, 

0(e2)  :  Ulyy  =  -u0xx,  Mi(y  =  0)=0,  wi(y=l)  =  0 

=>■  ui(x,y)  =  \f"{x){y -y3). 

0(e4)  :  u2yy  =  -U\xx,  m2( y  =  0)  =  0,  u2(y  —  1)  =  0 

=>■  U2(x,  y)  =  3^0  f""(x)(ly  -  lOj3  +  3y5). 

If  the  function/ (v)  in  the  boundary  condition  at  the  top  of  the  domain  is  smooth,  this 
expansion  can  be  continued  to  all  orders  so  as  to  obtain  the  outer  solution  in  terms  of 
polynomials  in  y  times  derivatives  of f(x).  However,  this  solution  will  not  in  general 
satisfy  the  boundary  conditions  (8.12c)  at  v  =  0  and  x  =  l,  and  hence  boundary 
layer  corrections  will  be  needed. 

Noting  that  (8. 12a)  has  a  small  parameter  multiplying  the  highest  order  derivative 
in  the  v-direction,  we  recognise  it  as  a  singular  perturbation  problem  of  the  type 
treated  in  Chap.  7  and  we  seek  boundary  layers  at  v  =  0  and  v  =  1.  In  order  to 
analyse  the  structure  of  the  inner  solution  with  respect  to  x,  we  assume  a  scaled 
solution  of  the  form  (7.21), 


*=— A  u=U(X,y) 

£a 


(8.15) 


where  we  have  already  made  use  of  the  Dirichlet  boundary  conditions,  u  =  g  =  0(1) 
for  the  scaling  of  U.  Substituting  this  form  into  (8.12a)  yields 

S2~2a  Uxx  +  Uyy  =  0, 

which  selects  the  distinguished  limit  a  =  1  for  the  inner  solution  at  both  X*  =  0 
for  the  left  boundary  and  =  1  for  the  right  boundary  layer.  We  will  describe  the 
solution  for  x*  =  0  with  the  construction  for  the  right  boundary  layer  following 
analogously. 

While  the  inner  problem  with  a  =  1  has  brought  us  back  to  solving  the  full 
Laplace’s  equation, 

Uxx  +  Uyy  =  0,  (8.16a) 


there  are  subtle  changes  in  the  domain  and  boundary  conditions  that  in  this  context 
make  the  inner  problem  more  tractable  than  the  original  full  problem.  The  domain 
for  this  inner  problem  is  a  semi-infinite  strip,  0  <  y  <  1  with  X  >  0.  Since  x  =  sX 
with  X  =  0(1)  in  the  left  boundary  layer,  we  can  re-write  the  y  =  1  boundary 
condition  as 


U(X,  1)  =f(sX)  =/(  0)  +  sf(0)X  +  \s2f"{  0)X2  +  •  •  • 
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yielding  boundary  conditions  on  the  leading  order  inner  solution  as  constant  values 

U0(X,  0)  =  0,  U0(X,  1)  =/(0).  (8.16b) 

The  boundary  condition  atX  =  0  is  unchanged  from  (8.12c),  but  now  the  remaining 
boundary  condition  is  supplied  by  the  analogue  of  the  asymptotic  matching  condition 
with  the  outer  solution  (see  (7.13)) 

lim  Uo(X,y)  =  lim  u(x,y).  (8.16c) 

X^oo  x^O 

Consequently  (8.12c),  (8.16c)  and  (8.14b)  specify  the  left  and  right  boundary  con¬ 
ditions  as 

Uo(0,y)  =  g0(y),  Uo(X  oo,  y)  =f(0)y.  (8.16d) 

Noting  the  term/(0)y  from  the  outer  solution  is  present  in  both  (8.16b,  8.16d),  we 
observe  that  the  problem  can  be  expressed  in  terms  of  the  boundary  layer  correction 


function  V(X,y)  defined  by 

U0(X,y)  =  V(X,y)+f(P)y,  (8.17) 

where  V(X,y)  satisfies  the  problem 

Vxx  +  Vyy  =  0,  (8.18a) 

V(X,  0)  =  0,  V(X,  1)  =  0,  (8.18b) 

V(0 ,y)  =  go(y)  -f(0)y,  V(X  ->  oo,  y)  =  0.  (8.18c) 


This  matches  the  form  of  the  Dirichlet  “building-block”  problem  with  a  single  in¬ 
homogeneous  boundary  condition  described  in  Sect.  8.1.  Here  the  homogeneous 
boundary  conditions  determine  the  oscillatory  eigenfunctions  to  be  in  the  y-direction 
with  Xn  =  rut  yielding 


oo 

V(X,y)  =  y ^  cne~nnX  sin (nny),  (8.19a) 

/?=  1 


where 


cn  =  2  [  [go(y)  —f  (0)y]  sminny)  dy.  (8.19b) 

Jo 


We  note  that  the  e~njzX  factors  resulted  from  solving  the  ODE  A”  —  n2Tt2An  =  0 
yielding  general  solutions  An(X)  =  doen7tX  +  d\e~UJlX  and  enforcing  the  X  — >  oo 
boundary  condition  to  then  eliminate  the  un-matchable  exponentially  growing  modes. 
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In  conclusion,  the  leading  order  uniform  solution  can  then  be  composed  from  the 
outer  solution  and  the  boundary  layer  corrections, 

MOunifU,  y)  =  uo(x,  y)  +  VL(XL,  y)  +  VR{XR,  y),  (8.20) 

where  VL(XL,y)  is  given  by  (8.19a,  8.19b)  with  XL  =  x/e  and  V^(X^,y)  analo¬ 
gously  by  the  right  boundary  layer  correction  with  XR  =  (x  —  1  )/s. 


8.3  The  Insulated  Wire 


Current  flow  through  an  insulated  wire  can  be  represented  by  replacing  the  Dirich- 
let  conditions  on  the  upper  and  lower  boundaries  with  no-flux  Neumann  boundary 
conditions  (8.2).  Using  the  same  scalings  as  in  the  previous  example  (8.11),  the 
nondimensionalized  problem  on  0  <  v  <  1  and  0  <  y  <  1  takes  the  form, 


du 


u(x  =  0)  =  go  00, 


£  Uyx  +  M 


yy 


du 

—  =  0 
dy  y=l 


for 


u(x  =  1)  =  glOO 


0  <  v  <  1, 

for  0  <  y  <  1. 


(8.21a) 

(8.21b) 

(8.21c) 


This  problem  describes  current  flow  in  a  straight  insulated  wire  with  prescribed 
end- voltages. 

We  begin  by  constructing  the  outer  solution  in  the  form  u  ~  u$  +  s2u\  +  £4U2. 
The  leading  order  problem  for  uq(x,  y)  is  then  given  by 


UQyy  =  0,  UQy(y  =  0)  =  0,  uoy(y  =  1)  =  0,  (8.22) 

where  the  general  solution  of  the  differential  equation  is  again  a  linear  function  of  y 
with  v-dependent  coefficients,  uq(x,  y)  =  C\(x)y  +  C2(x).  Applying  the  boundary 
conditions  yields 

u0(x,y)  =  C2(x),  (8.23) 


and  so  the  solution  has  been  shown  to  be  independent  of  y,  but  is  given  by  some  as-yet 
undetermined  function  of  v.  Often  in  perturbation  methods,  undetermined  parts  of 
solutions  will  get  pinned  down  by  conditions  needed  for  consistency  appearing  at 
higher  orders,  or  from  matching  to  solutions  on  other  parts  of  the  domain.  At  order 
0(s2)  we  have  the  problem  for  u\  (x,  y). 


yy  -  U§xx,  U\y(x ,  0)  —  0,  U\y(x,  1)  -  0. 


(8.24) 
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Substituting  (8.23)  for  uq  into  this  problem,  we  obtain  the  general  solution 

mi  =  -\c'{(x)y2  +  C3(x)y  +  C4(x). 


Applying  the  boundary  condition  at  y  =  0  determines  C2(x)  =  0;  applying  the 
boundary  condition  at  y  =  1  yields 


d2C2 

dx2 


for  0  <  x  <  1 ; 


(8.25) 


this  is  an  ordinary  differential  equation  for  the  leading  order  solution.  Such  equations 
determining  consistency  conditions  on  lower-order  solutions,  coming  out  of  parts  of 
higher  order  problems,  are  often  called  solvability  conditions. 

Equation  (8.25)  has  a  simple  linear  general  solution, 


C2&)  =  D\x  +  D2 , 


but  in  order  to  determine  the  constants  D\  2,  we  need  to  impose  boundary  conditions. 
Our  original  problem  indeed  had  boundary  conditions  at  v  =  0  and  x  =  1,  (8.21c), 
but  for  general  imposed  functions  go  00  and  gi(y),  C2{x)  cannot  possibly  satisfy 
those  y-dependent  conditions.  Hence  we  must  construct  boundary  layers  to  satisfy 
(8.21c)  while  determining  the  effective  boundary  conditions  on  C2(x). 

Following  the  same  scaling  (8.15)  as  in  the  previous  problem,  we  determine  the 
inner  distinguished  limit  a  =  1  for  the  boundary  layers  both  at  x*  =  0  and  x*  =  1 . 
We  again  recover  Laplace’s  equation  as  the  inner  problem, 

Uxx  +  Uyy  =  0.  (8.26a) 

Analysing  the  boundary  layer  at  X*  =  0  (the  analysis  at  =  1  is  analogous),  the 
three  boundary  conditions  we  can  draw  from  (8.21b,  8.21c)  are  straightforward, 

U( 0,  y)  =  g0(y)9  Uy(X ,  0)  =  0,  Uy(X,  1)  =  0,  (8.26b) 

and  the  final  boundary  condition  is  obtained  from  asymptotic  matching  to  the  outer 
solution,  using  (8.16c), 

U(X  ->  00,  y)  =D2.  (8.26c) 

Again,  it  is  convenient  to  recast  this  problem  in  terms  of  the  boundary  layer  correction 
in  order  to  separate  out  the  overlap  from  matching,  so  we  write 

U(X,y)  =  V(X,y)+D2,  (8.27) 


with  V(X,y)  satisfying 


VXX  +  Vyy  =  0, 


(8.28a) 
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Vy(X,  0)  =  0,  Vy(X,  1)  =  0,  (8.28b) 

V(0,y)=go(y)-D2,  V(X^w,y)=0.  (8.28c) 

Applying  separation  of  variables  to  this  Neumann  boundary  value  problem  yields  the 
solution  as  a  cosine  series, 


(X) 

V(X,  y)  =  ^  cne~'mX  cos(njty),  (8.29) 

n=0 


where  the  matching  condition  eliminated  the  exponentially  growing  modes,  ennX . 
The  Cn  s  are  then  given  by  the  Fourier  coefficients  of  the  X  =  0  boundary  condition, 

[go  00  ~D2]dy,  and  cn  =  2  [  [go  00  -  D2]  cos  (rut  y)  dy  (8.30) 

Jo 


for  n  =  1,2, _ For  n  >  0  all  of  the  cne~UJlX  factors  in  (8.29)  decay  to  zero  as 

X  — ^  oo;  any  finite  values  for  cn  would  be  compatible  with  the  remaining  boundary 
condition,  that  V(X  ->  oo)  =  0.  The  only  term  that  contributes  to  the  value  of  V  for 
X  ->  oo  is  the  n  =  0  term,  the  uniform  constant,  V  (X  — >  oo)  ~  cq.  In  order  to  satisfy 
the  matching  condition  (8.28c),  we  must  have  co  =  0  in  (8.30).  This  determines  the 
boundary  condition  on  the  outer  solution  (8.25)  in  terms  of  the  average  of  the  go 
boundary  function, 

C2(0)=D2=  [  go(y)dy. 

Jo 

Analogous  analysis  of  the  boundary  layer  at  v*  =  1  yields  the  boundary  condition 
for  C2(\)  in  terms  of  the  average  value  of  gi  and  determines  the  outer  solution  as 


u0(x,y)  =  x 


8i(y)  —  8o(y)  dy 


goOO  dy 


) 


(8.31) 


with  the  leading  order  uniform  solution  taking  the  same  form  as  (8.20). 

It  is  interesting  to  compare  the  ‘information  flow’  or  ‘structural  dependence’  of 
the  perturbation  solution  for  the  past  two  problems.  For  the  Dirichlet  problem,  the 
outer  solution  can  be  determined  independently  of  the  boundary  layers,  and  sets 
matching  conditions  for  the  boundary  layers, 


BL 


BL 


Problem  (8.12): 


Outer 
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while  in  the  Neumann  problem,  the  outer  solution  cannot  be  completely  specified 
until  the  boundary  layer  solutions  have  been  calculated, 


Problem  (8.21): 


BL  ->  Outer  BL 


8.4  The  Nonuniform  Insulated  Wire 

We  now  extend  our  previous  analysis  to  consider  the  problem  of  an  insulated  wire 
whose  cross-section  is  not  of  constant  width,1  see  Fig.  8.2.  The  main  consequence  of 
this  change  is  that  the  domain  is  no  longer  separable2 3  so  the  method  of  separation  of 
variables  from  Sect.  8.1  can  not  be  used  to  construct  an  exact  solution  of  the  whole 
problem.  However,  boundary  layer  theory  and  the  method  of  matched  asymptotics 
again  works,  with  just  minor  extensions. 

We  begin  with  the  dimensional  problem  for  Laplace’s  equation  on  a  domain  with 
0  <  X  <  L  and  0  <  Y  <  H/(X/L)  where  fix)  is  a  given  function  describing  the 


width  of  the  wire, 

Uxx  +  Uyy  =  0  (8.32a) 

along  with  the  same  imposed- voltage  Dirichlet  boundary  conditions  at  X  =  0  and 
X  =  L,  but  it  is  now  notable  that  the  lengths  of  the  ends  may  differ 

U(0,  Y)  =  UgoCt/H)  forO  <  Y  <  Hf( 0),  (8.32b) 

U(L,  Y)  =  Ugi(Y/H)  forO  <  Y  <  Hf(  1).  (8.32c) 

The  lower  edge  of  the  domain  remains  straight  and  the  associated  no-flux  boundary 
condition  corresponds  to  the  first  condition  in  (8.21b) 

Uy(X,  0)  =  0  for  0  <  X  <  L.  (8.32d) 


We  must  be  a  little  more  careful  in  order  to  properly  determine  the  form  of  the  no¬ 
flux  condition,  n  •  VU  =  0  along  the  upper  boundary.  Recalling  from  multivariable 
calculus  that  the  normal  to  a  constant- value  contour  of  a  function  B(X,  Y)  is  given  by 
the  gradient  of  B,  we  construct  B(X,  Y)  =  Y  —  H/(X/L).  This  defines  our  boundary 
as  the  zero-level  contour  of  B(X,  Y).  To  obtain  a  unit  normal  vector,  we  calculate 
the  normalised  gradient: 


'This  is  sometimes  called  a  “domain  perturbation”  [61],  also  see  Hinch  [47]. 

2  A  separable  domain  being  expressible  as  Cartesian  products  of  intervals  in  independent  variables, 
for  example  (X  e  [0,  L])  x(Ye  [0,  H]). 

3  Sometimes  called  a  level  set. 
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Fig.  8.2  The  domain  for  the 
non-uniform  insulated  wire, 
problem  (8.32) 


VB 

VB 


,  (— (H/D/'CX/L),  1) 

A  +  (H/L)2/'(X/L)2  ’ 


where  the  zb  sign  is  chosen  so  as  to  correspond  to  the  unit  outward  normal,  pointing 
out  of  the  domain  at  the  boundary  (in  this  case  +  for  the  upper  boundary).  Finally, 
taking  the  dot  product  with  the  gradient  VU  yields  the  condition 


H  .  91) 

L/(X/L)3X 


+ 


(X,H/  (X/L)) 


9U 

9Y 


=  0. 


(8.32e) 


(X,H/  (X/L)) 


Employing  the  previously  used  scalings  (8.1 1)  we  arrive  at  the  nondimensional  prob¬ 
lem  on  0  <  v  <  1  and  0  <  y  </M, 

£  i^xx  H-  Uyy  —  0,  (8.33a) 

subject  to  the  boundary  conditions 

u(0,y)=go(y)  0  <y  </(0),  (8.33b) 

u(l,y)=gi(y)  0  <y  </(l),  (8.33c) 

and  for  0  <  v  <  1 

uy(x ,  0)  =  0,  uy(x,f(x ))  —  £2ff(x)ux(x,f(x))  =  0.  (8.33d) 


Following  as  in  Sect.  8.3,  we  begin  by  constructing  the  outer  solution,  and  at  leading 
order  obtain  the  problem, 


UQyy  =  0,  U0y(X ,  0)  =  0,  UQy(X,f(x))  =  0, 


(8.34) 
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yielding  uq(x,  y )  =  C2OO  as  in  (8.23).  To  determine  C2C*:)  we  look  to  the  next  order 
problem, 

U0xx  +  u\yy  =  0,  u\y(x,  0)  =  0,  u\y(x,f)  =  f  (*) u0x (x , f) .  (8.35) 

Substituting  in  for  uo ,  we  find  the  general  solution, 

m(x,y)  =  - \C2{x)y 2  +  C3(x)y  +  C4(x).  (8.36) 

Imposing  the  y  =  0  boundary  condition  determines  C3  =  0.  The  top  boundary 
condition  at  y  =  fix)  then  takes  the  form 

uiy(x,f(x))  =  f(x)C'2(x); 

equating  this  expression  with  the  y-derivative  of  the  general  u\(x,  y )  solution  given 
by  (8.36)  evaluated  at  y  =f  yields  the  compatibility  condition 

-C"(*)/M  =f\x)C'2(x). 


Using  the  product  rule,  this  result  can  be  expressed  more  compactly  as 


d 

dx 


(8.37) 


this  is  a  generalisation  of  (8.25)  for  non-constant/ (x) .  Boundary  conditions  for  (8.37) 
are  then  determined  by  solving  for  the  boundary  layers  at  x*  =  0  and  x*  =  1  using 
separation  of  variables  as  was  done  in  Sect.  8.3  to  construct  the  uniform  solution  on 
the  whole  domain  (8.20). 


8.5  Further  Directions 

The  main  asymptotic  reduction  considered  in  this  chapter  is  based  on  having  the 
aspect  ratio  as  the  limiting  small  parameter,  s  =  H /L  ^  0.  Stemming  from  the 
historical  uses  of  this  approach  in  the  context  of  many  problems  in  fluid  dynamics 
[1,61]  (involving  the  flow  of  oil  between  machine  parts,  and  other  thin  layers  like  tears 
films  on  the  eye),  these  types  of  problems  are  sometimes  called  lubrication  models 
[51,  79,  96].  They  offer  valuable  simplifications  to  problems  by  reducing  systems  to 
lower-dimensional  domains  (in  our  case  from  a  PDE  on  a  two-dimensional  region 
to  an  ODE  on  a  one- dimensional  interval)  by  separating  out  the  “thin”  directions 
to  leave  an  underlying  governing  model  on  the  “wide”  spatial  directions  with  the 
appropriate  boundary  conditions  incorporated.  These  problems  are  also  sometimes 
called  slender  body  asymptotics ,  and  reduced- dimension  models. 
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8.6  Exercises 


8.1  Consider  the  problem  for  Laplace’s  equation  on  0  <  X  <  L  and  0  <  Y  <  H, 


Uxx  +  UYY  =  0, 


with  boundary  conditions 

U(0,  Y)  =  0,  U(L,  Y)  =  0  for  0  <  Y  <  H, 

_  ^-3X/L  for  0  <  X  <  L. 

Y=H 

(a)  Use  separation  of  variables  to  construct  the  exact  solution  U  (X,  Y)  (valid  for  any 
values  of  H,  L  >  0). 

(b)  Nondimensionalize  the  problem  and  consider  the  slender  limit  of£  =  H/L^0 
to  find  the  leading  order  outer  solution  uo(x,  y). 

(c)  Rescale  the  solution  from  (a)  by  the  natural  lengthscales  and  show  that  it  is 
equivalent  to  the  solution  from  (b)  in  the  limit  s  — >  0. 

8.2  Consider  the  problem  for  Laplace’s  equation  on  0  <  X  <  L  and  0  <  Y  <  H  in 

the  limit  that  H  ->  0  with  L  =  0(1), 


U(X,0)  =  0, 


au 

ay 


Uxx  +  UYy  =  0, 


with  boundary  conditions 

U(0,  Y)  =  sin  (fffi,  U(L,  Y)  = -(Y/H)2  for  0  <  Y  <  H, 

U(X,  0)  =  sin  (^V  U(X,  H)  =  cos  (^*)  for  0  <  X  <  L. 

(a)  Determine  the  leading-order  outer  solution. 

(b)  Determine  the  boundary  layer  corrections  for  the  left  and  right  leading-order 
inner  solutions. 

(c)  Determine  the  leading  order  uniformly  valid  solution. 

(d)  Use  (c)  to  show  that  the  boundary  layer  is  crucial  for  calculating  the  leading 
order  value  of  the  average  flux  at  the  right  end  of  the  domain: 


Hint:  Your  solution  will  involve  a  sum  given  by  the  Riemann  zeta  function, 
1/(2* +1)3  =  |?(3)«  1.0518. 
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8.3  Consider  the  problem  for  Laplace’s  equation  on  the  non-uniform  slender  domain, 
0  <  X  <  L  and  0  <  Y  <  F(X)  where  F(X)  =  15  +  5cos(37rX/L)  in  the  limit 
L  ->  oo  with  H  =  1 , 

UXX  +  Uyy  =  0, 


with  boundary  conditions 


U(0,  Y) 


au 

aY 


=  0, 

Y=0 


U(L,  Y)  =  3Y2 

for 

0  <  Y  <  F(X), 

n  -  VU 

=  0 

for 

0  <  X  <  L. 

Y=F(X) 


(a)  Nondimensionalize  the  problem. 

(b)  Write  the  equations  for  the  outer  solution  up  to  0(s2).  Determine  the  first  two 
terms  in  the  expansion  of  the  outer  solution. 

(c)  Determine  the  boundary  layer  corrections  for  the  left  and  right  leading-order 
inner  solutions.  Hint:  What  is  yBL  in  0  <  y  <  yBL  in  each  boundary  layer? 

(d)  Use  matching  to  determine  the  v-boundary  conditions  for  the  outer  solution  and 
then  obtain  the  leading-order  outer  solution. 

8.4  Consider  the  problem  for  Laplace’s  equation  for  a  curved  semicircular  arc  of 
wire  of  varying  width  on  0  <  6  <  tx  with  R  —  W/(0)  <  R  <  R  +  W/(0)  in  the 
limit  of  a  large  radius  of  curvature,  R  — oo, 


a2u  i  au  i  a2u 

9R2  +  R9R  +  R2  902 

with  boundary  conditions 


n  -  VU 


R=R-W/(0) 


=  0, 


U(R,0)  =  Ugo(R/R), 


n  - VU 


=  0 

R=R+W/(<9) 


U(R,  n)  =  Ugi(R/R) 


for  0  <  9  <  7T, 
for  0  <  X  <  L. 


(a)  Nondimensionalize  the  problem  using  R  =  Rr,  but  show  that  this  does  not  yield 
a  scaled  problem  with  a  small  parameter. 

(b)  Let  r  =  1  +  sy.  What  should  be  used  for  the  aspect  ratio  el  Write  the  complete 
problem  satisfied  by  u(y,  6). 

(c)  Write  the  problems  for  the  outer  solution,  u(y,  0)  ~  uo (y,  0)  +  £u\(y,  0)  + 
e2u2(y,  9). 

8.5  The  derivation  of  the  Korteweg  de  Vries  (KdV)  equation  for  waves  on  the  surface 
of  shallow  layers  of  water  follows  from  similar  asymptotic  reductions  of  Laplace’s 
equation  with  appropriate  boundary  conditions  [31,  58,  80].  Consider  the  following 
nondimensionalized  problem  for  a  time-dependent  potential  <p(x,y,t)  and  the  shape 
of  waves  on  the  top- surface  of  a  fluid  layer,  F(x,  t),  on  —  oo  <  x  <  oo, 
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£  4>xx  T  0yy  —  0 

onO  <  y  <  1  +  s2F 

(8.38a) 

ii 

O 

at  y  =  0 

(8.38b) 

<Pt  +  l(e2<A?  +  <Py )  +  F  ~  0 

aty  =  1  +  s2F 

(8.38c) 

s2Ft  +  s4(j)xFx  =  4>y 

at  y  =  1  +  £2F 

(8.38d) 

The  first  steps  in  the  derivation  are: 

(a)  Expand  0  =  0 q  +  £20i  +  £402  +  0(s6)  and  F  =  Fo  +  £2F\  +  0(s4)  for  s  — >  0 
and  use  (8.38a,  8.38b)  to  determine  0o,  01 , 02  in  terms  of  polynomials  in  y  and 
three  un-determined  functions  Co(x,  t),  C i,  (x,  t),  C2(x,t). 

(b)  Determine  the  leading  order  equations  for  Co,/o  obtained  from  substituting  the 
expansions  for  0,  F  into  (8.38c,  8.38d). 

(c)  Determine  the  equations  for  the  0(s2)  next-order  equations  obtained  from 
(8.38c,  8.38d).  (To  be  concluded  in  Exercise  9.12.) 

8.6  The  derivation  of  the  porous  medium  equation ,  introduced  earlier  as  Eq.  (5.21), 

dh  _  i  d  /  3dh\ 
dt  3  dx  \  dx)  ’ 

follows  from  a  long-wave  analysis  for  the  evolution  of  the  height  of  a  layer  of  a 

viscous  fluid  y  =  h(x,  t)  [1,  79].  Consider  the  e  — >  0  limit  of  the  problem 


T  Uyy  -  Hx 

on  0  <  y  <  h 

(8.39a) 

Ux  +  Vy  ——  0 

on  0  <  y  <  h 

(8.39b) 

u  =  v  =  0 

aty  =  0 

(8.39c) 

Uy  =  0 

aty  =  h 

(8.39d) 

ht  +  uhx  =  v 

at  y  =  h 

(8.39e) 

where  u(x,  y,  t),  v(x,  y,  t)  are  the  horizontal  and  vertical  components  of  the  fluid 
velocity  respectively  and  the  last  three  equations  are  boundary  conditions  on  the  first 
two. 

(a)  Show  that  by  using  (8.39b,  8.39c)  to  express  v  in  terms  of  u,  Eq.  (8.39e)  can  be 
written  as 

dh  a  /  fh 

a7  +  toU 

(b)  Determine  the  leading  order  horizontal  velocity,  uo(x ,  y,  t)  in  terms  of  h  from 
(8.39a,  8.39c,  8.39d)  to  then  obtain  the  porous  medium  equation  from  (8.40). 


=  0. 


(8.40) 
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8.7  Note  that  in  all  of  the  problems  considered  here  we  assumed  a  perturbation 
expansion  for  the  outer  solution  given  in  powers  of  s2,  (8.13).  Sometimes  this  must 
be  modified.  Consider  problem  (8.12)  with  the  Dirichlet  condition  u(x,  1)  =ion  the 
top  boundary.  Examine  the  expansion  of  the  boundary  layers  to  show  that  if  we  wish  to 
get  a  solution  that  is  uniformly  valid  to  0(s2)  then  the  expansion  u  ~  uq  +  su\  +  s2U2 
is  needed. 


Chapter  9 

Weakly-Nonlinear  Oscillators 


Many  systems  function  through  repeated  cycles  of  operation — from  the  spinning  of 
gears  in  a  machine,  the  physiology  of  heartbeats,  biological  behaviour  on  the  24  h 
circadian  cycle,  to  seasonal  climate  changes  during  the  motion  of  the  earth  around 
the  sun  each  year.  The  most  basic  models  for  such  systems  take  the  form  of  oscillator 
equations,  having  regular  predictable  periodic  solutions;  the  simplest  such  model  is 
the  linear  oscillator  equation, 


d2x 
dt 2 


+  COqX 


(9.1) 


with  natural  frequency  co o.  It  has  the  general  solution  x(t)  =  A  sin(coot)  + 
B  cos(&>oO>  with  the  constants  A  and  B  being  determined  by  the  initial  conditions 
imposed  on  the  system. 

More  detailed  models  of  oscillatory  systems  include  additional  terms  that  describe 
other  influences  that  modify  the  solutions.  If  these  effects  are  weak,  as  indicated  by 
a  dimensionless  system  parameter  being  small,  then  the  model  can  be  written  as 


d2x 
dt 2 


+  tL>lx  =  sf  (x,  ^ ,t )  , 


(9.2) 


Such  models  are  called  weakly -nonlinear  oscillators  since  they  reduce  to  the  linear 
oscillator  (9.1)  when  the  perturbation  terms  (potentially  including  nonlinearities)  are 
suppressed.  In  the  context  of  mechanical  systems,  Eq.  (9.2)  can  describe  the  small- 
amplitude  motion  of  a  pendulum,  or  a  mass  attached  to  a  nonlinear  spring. 

In  the  framework  of  the  earlier  chapters  on  perturbation  methods,  (9.2)  may  appear 
to  be  a  straightforward  regular  perturbation  problem,  but  we  will  see  that  regular 
perturbation  expansions  will  not  be  able  to  address  the  question  of  greatest  interest 
for  oscillating  systems,  which  is, 

If  we  understand  the  behaviour  of  the  system  for  a  single  cycle  of  the  oscillation,  can  we 
determine  how  the  perturbation  forcing  terms  cumulatively  affect  the  problem  over  long 
times  and  many  oscillation  periods? 
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Two  perturbation  methods  will  be  described  to  illustrate  how  weak  influences  can 
be  incorporated  into  the  leading  order  solution  to  obtain  more  accurate  long-time 
predictions  of  oscillatory  behaviour. 


9.1  Review  of  Solutions  of  the  Linear  Problem 


We  begin  by  briefly  reviewing  the  essential  results  for  the  linear  oscillator  equation 
that  form  the  basis  for  the  perturbation  methods  for  (9.2).  As  discussed  above,  the 
unforced  linear  oscillator  equation  is  characterised  by  a  natural  frequency  coo  >  0 
and  has  the  general  homogeneous  periodic  solution  Xh(t), 


d2x 
dt 2 


+  COqX 


=  0 


>  Xh(t)  =  A  sin(&>oO  +  Bcos(coot).  (9.3) 


Now  consider  the  inhomogeneous  version  of  this  equation,  driven  by  a  periodic 
forcing  function  fit). 


x"  +  COqX  =  fit). 


The  function  /  can  be  directly  replaced  by  its  Fourier  series,  f(t)  =  Tk  Ck  sin(y*r)  + 
Dk  cos (ykt),  and  then  by  the  linearity  of  the  equation,  the  overall  solution  will  be 
the  sum  of  the  contributions  from  each  of  the  terms  in  the  series  for  /,  yielding 
x(t)  =  xh(t)  +  yy  Xk(t).  Each  Fourier  term  yields  a  problem  of  the  form 

x'k  +  colxk  =  Ck  sin(y&0  +  Dk  cos (ykt).  (9.4) 


If  yk  fz  coo  then  the  general  solution  is  given  as  a  combination  of  homogeneous  and 
particular  solutions  as 


Xk(t)  =  A  sin(&>oO  +  B  cos(coot)  + 


Ck 


Homogeneous  solution 


"o  -Yk 


sin  (ykt)  + 


Dk 


~  Yk 


cos(ykt) 


Particular  solution 


The  particular  solution  can  be  obtained  from  the  method  of  undetermined  coefficients 
or  other  elementary  approaches.  This  solution  is  not  valid  for  yk  =  coo,  which  is 
called  the  case  of  resonant  forcing  (forcing  at  a  natural  frequency  of  the  system). 
For  resonant  forcing,  the  method  of  undetermined  coefficients  suggests  a  different 
form  for  the  particular  solution,  xp(t)  =  c\t  sin(&>oO  +  C2 1  cos  (two  0-  Substituting 
this  into  (9.4)  and  matching  coefficients  of  corresponding  terms  yields  the  general 
solution  as 

C  D 

Xatoit)  =  A  sin(<woO  +  B  cos(coot)  H - 1  cos(coot)  H - 1  sin(u;oO  • 

- v - '  2coo  2coo 

Homogeneous  solution  ^  V  ^ 

Resonant  forced  response 
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Fig.  9.1  Solutions  of  the 
forced  linear  oscillator 
equation  (9.4)  with  coo  =  1: 
the  non-resonant  solution  for 
Yk  —  2  and  resonant  solution 
for  yk  =  1 


The  resonant  response,  while  being  oscillatory  with  period  In /coo,  is  notable  for 
having  an  amplitude  that  exhibits  unbounded  growth  with  increasing  time.  The  terms 
t  cos  coot  and  t  sin  coot  are  commonly  called  secular  growth  terms  (see  Fig.  9.1).  We 
will  see  that  the  occurrence  of  such  terms  is  the  central  issue  that  must  be  addressed 
in  constructing  accurate  long-time  asymptotic  solutions  for  perturbed  oscillators. 


9.2  The  Failure  of  Direct  Regular  Expansions 

In  order  to  illustrate  the  shortcomings  of  the  standard  regular  perturbation  expansion 
approach,  we  consider  two  simple  perturbed  linear  oscillator  problems  in  the  limit 
s  — >  0,  namely 

x" x  =  —  sx,  v(0)  =  1,  xr(0)  =  0,  (9.5) 

and 

x"  +  x  =  —sx',  v(0)  =  1,  x^O)  =  —\s-  (9.6) 

Assuming  the  solutions  to  be  perturbation  expansions  of  the  form 

x{t)  =  Xo(t)  +  £X\ (t)  +  £2  X2  (0  +  •  •  •  ,  (9.7) 

yields  an  ordered  sequence  of  initial  value  problems  for  each  xn(t)  with  the  leading 
order  for  both  (9.5)  and  (9.6)  reducing  to  (9.3)  with  <^>o  =  1  *  Solving  these  sequences 
of  sub-problems  up  to  0(s2),  yields  the  solution  of  (9.5)  as 

x(t)  ~  cos  t  —  ^£t  sin  t  +  s 2  sin  t  —  t2  cos  ,  (9.8) 

while  the  solution  of  (9.6)  is  given  by 

x(t)  ~  cos  t  —  \ st  cos  t  +  ^ s 2  it  sin  t  +  t2  cos  tj  .  (9.9) 
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While  these  two  expansions  appear  quite  similar  in  form,  this  masks  the  fact  that 
they  come  from  problems  predicting  very  different  behaviours.  Problem  (9.5)  can  be 
re-written  in  the  form  (9.3)  as  x"  +  (1  +  s)x  =  0  and  it  is  straight  forward  to  show 
that  its  exact  solution  is 

*exact(0  =  COS(VT+”e  t).  (9.10) 

In  other  words,  the  exact  solution  has  a  slightly  perturbed  natural  frequency,  coo  = 
Vl  +  £,  compared  to  coo  =  1  for  the  leading  order  problem,  x'q  +  xo  =  0.  In  contrast, 
(9.6)  is  a  weakly  damped  linear  oscillator  of  the  form  x"  +  ex'  +  x  =  0,  and  its  exact 
solution  is  given  by 


Xexact (t)  =  e  £t/1  COS (J 1  -  \s2  t^.  (9.11) 

The  respective  periodic  and  decaying  oscillatory  behaviours  of  these  two  solutions 
are  shown  in  Fig.  9.2  along  with  plots  of  the  expansions  (9.8)  and  (9.9).  While  the 
expansions  match  their  corresponding  solutions  for  early  times,  both  dramatically 
diverge  when  their  later  time  behaviours  become  dominated  by  the  secular  growth 
terms. 

While  these  results  might  suggest  that  the  perturbation  expansions  have  produced 
incorrect  descriptions,  we  should  not  dismiss  them  too  soon.  In  fact,  taking  the  Taylor 
series  expansions  of  (9.10)  and  (9.11)  for  s  — >►  0  at  fixed  finite  times  ( t  =  0(1)) 
directly  reproduces  (9.8)  and  (9.9).  Equations  (9.8)  and  (9.9)  are  indeed  correct,  but 
they  must  be  used  with  caution. 

Applying  the  fundamental  assumption  of  asymptotic  ordering  to  the  terms  in  (9.7), 
the  expansions  are  valid  only  when 

1*0(01  >  |e*i(0l  >  |£2*2(0I  >  •  ,  £  ->  0.  (9.12) 


Fig.  9.2  Exact  solutions  and  regular  expansions  for  problem  (9.5):  (9.10)  and  (9.8)  (Left)  and  for 
problem  (9.6):  (9.11)  and  (9.9)  (Right),  both  with  s  =  1/5 
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Considering  the  first  two  terms  from  (9.8),  |  cos  t\  |  \st  sin  t  |  holds  only  if  1  st. 
In  other  words,  the  expansions  can  only  be  expected  to  hold  while  this  product  is  small 
(“for  a  limited  time  only”  is  an  appropriate  phrase,  here  applying  for  0  <  t  <<C  1  /s). 

These  examples  are  be  a  bit  troubling — we  have  worked  out  higher  order  terms  in 
the  asymptotic  expansions  since  the  leading  order  solution  by  itself,  xo(t)  =  cos  t, 
does  not  capture  the  differences  that  distinguish  these  two  problems  from  each  other. 
Yet  the  additional  terms  yield  contributions  that  restrict  the  validity  of  the  expansion 
to  relatively  short  times,  when  effects  like  slow  growth  or  decay  or  changes  in  the 
oscillation  frequency  have  not  yet  fully  developed.  This  suggests  the  need  for  other 
forms  of  perturbation  expansions  that  can  overcome  these  limitations. 


9.3  Poincare-Lindstedt  Expansions 


The  failure  of  the  regular  expansion  to  describe  periodic  motion  on  long  timescales  in 
the  context  of  astronomy  (for  the  motion  of  the  planets)  motivated  the  development  of 
an  improved  approach  known  as  the  Poincare-Lindstedt  method.  A  key  idea  behind 
the  method  is  that  the  regular  perturbation  expansion  is  too  restrictive  in  its  form 
and  does  not  allow  for  the  possibility  that  the  solution  may  have  a  frequency  that  is 
shifted  from  the  leading  order  natural  frequency  coo  (as  in  (9.10)). 

The  Poincare-Lindstedt  approach  begins  with  a  change  of  variables  in  terms  of  a 
frequency  that  can  depend  on  the  perturbation  parameter, 

x(t)  =  x(0)  with  0  =  Q{e)t,  (9.13) 

such  that  12(0)  =  coo.  We  then  seek  regular  perturbation  expansions  in  terms  of  the 
new  unknowns, 


x(0)  —  xo (O')  T  sx\(0}  T  •  •  •  ,  if 2(0 )  —  coo  T  £co\  -F  •  •  •  .  (9.14) 

We  will  now  give  an  example  to  illustrate  how  this  seemingly  minor  change  allows  the 
Poincare-Lindstedt  method  to  eliminate  secular  growth  terms  and  obtain  perturbation 
expansions  that  are  valid  over  longer  times  for  some  problems. 

Consider  a  problem  similar  to  (9.5), 

*"  +  4x  =  -£x,  x(0)  =  3,  *'(0)  =  -8.  (9.15) 


After  the  change  of  variables  (9.13),  we  arrive  at  the  modified  problem  for  x(0), 


Q 


crx 

dO2 


+  4x  =  —£x,  x(0)  =  3, 


Q 


dx 

dO 


=  -8. 


(9.16) 


6=0 
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Substituting  expansions  (9.14)  into  (9.16)  and  separating  by  orders  of  s  yields,  at 

0(s°), 


,  d2xo 

»»^r  +  4«  =  o. 


*o(0)  =  3, 


=  -8, 

9=0 


and  at  0(sl), 


d2x  o 

+  4xi  =  -*o  -  2coqcoi  —^qY  ’ 


xi(0)  =  0,  &>o-£i(0)  =  —cdiXq(0), 


(9.17a) 


(9.17b) 


and  so  on  at  higher  orders.  Note  that  lower-order  terms  from  the  expansion  of  the 
solution  (xk  for  k  =  0,  1,  . . . ,  n  —  1)  should  be  shifted  to  the  right-hand  side  of  the 
equation  for  xn  and  be  treated  as  known  parts  of  the  inhomogeneous  forcing. 

Identifying  the  natural  frequency  as  coo  =  2  from  (9.15)  with  s  =  0,  the  leading 
order  solution  can  then  be  obtained  from  (9.17a)  as 


xq(0)  =  — 4sin0  +  3  cos  0. 


(9.18) 


Substituting  these  results  into  the  0(e )  problem  (9.17b)  yields 


4 


+  4x\  =  [16co  i 


4]  sin#  +  [—l2coi  +  3]  cos0, 


(9.19) 


x\ (0)  =  0,  2x[(0)  =  4co\. 

At  this  point  co\  is  an  undetermined  constant.  Problem  (9.19)  can  be  solved  for  any 
value  co\  to  obtain  x\ (t 9 ).  However,  noting  the  presence  of  the  resonant  forcing  terms, 
sin#  and  cos 0,  on  the  right-hand  side  of  the  equation,  the  solution  would  include 
secular  growth  terms.  But,  if  those  resonant  terms  in  (9.19)  were  eliminated  by  an 
appropriate  choice  of  oo\,  then  x\  (0)  would  be  bounded  and  vo  +  sx\  would  remain 
asymptotically  well-ordered  for  all  times.  For  this  problem,  this  criterion  selects 
co i  =  1/4.  This  choice  yields  x\  (0)  =  ^  sin  0,  and  then  using  (9.13)  and  (9.14)  the 
solution  of  the  original  problem  can  be  written  as 

x(t)  ~  (—2  +  ^s)  sin([2  +  \s]t)  +  3  cos([2  +  \e\t).  (9.20) 

This  solution  holds  over  0  <  t  <<C  1  /s1',  compare  this  with  (9.8),  which  was  valid 
only  over  0  <  t  <<C  1/e,  see  Fig.  9.3.  In  fact,  even  without  going  through  the  work  of 
solving  for  iq  (0),  the  Poincare-Lindstedt  approach  has  yielded  an  improved  solution 
by  determining  oo\  through  suppressing  the  resonant  terms  in  the  0(e)  equation;  the 
condition  onwi  is  another  example  of  a  solvability  condition. 
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Fig.  9.3  Comparison  of  the 
regular  expansion  and  the 
Poincare-Lindstedt  leading 
order  solution  against  the 
exact  solution  of  (9.15). 
Slight  shifts  between  the 
Poincare-Lindstedt  and 
exact  solution  are  visible  for 
larger  times 


While  the  example  above  was  a  linear  problem,  the  Poincare-Lindstedt  method 
extends  directly  to  nonlinear  equations  with  perturbation  terms  of  the  form  sf(x). 
However,  readers  are  right  to  suspect  that  the  approach  has  some  limitations.  Noting 
that  the  general  solution  of  the  leading  order  problem  has  two  independent  solution 
terms,  vo  =  AsinO  +  BcosO,  and  that  each  leads  to  resonant  forcing  in  (9.19), 
it  should  be  a  little  surprising  that  the  coefficients  of  both  resonant  terms  can  be 
zeroed  using  only  one  degree  of  freedom,  co\.  In  fact,  this  is  only  possible  when  x(t) 
is  a  periodic  solution ,  leading  to  a  degenerate  linear  system  for  the  coefficients  of 
the  resonant  forcing  terms.  Hence,  the  Poincare-Lindstedt  method  can  only  be  used 
to  obtain  periodic  solutions,  and  cannot,  for  example,  generate  the  slowly  decaying 
solution  (9. 1 1).  To  overcome  this  limitation,  we  consider  another  related  perturbation 
method. 


9.4  The  Method  of  Multiple  Time-Scales 

Like  the  Poincare-Lindstedt  method,  the  method  of  multiple  time-scales  (MMTS) 
determines  solutions  to  perturbed  oscillators  by  suppressing  resonant  forcing  terms 
that  would  yield  spurious  secular  terms  in  the  asymptotic  expansions.  The  method  of 
multiple  time-scales  makes  a  less  restrictive  assumption  on  the  form  of  the  solution 
than  employed  by  the  Poincare-Lindstedt  method;  it  assumes  that  the  solution  can 
be  expressed  as  a  function  of  multiple  (for  our  purposes,  just  two)  time  variables, 

x(t)  =  X(t ,  r),  (9.21) 

where  t  is  the  regular  (or  “fast”)  time  variable  and  r  is  a  new  variable  describing  the 
“slow-time”  dependence  of  the  solution.  In  a  physical  context,  t  could  represent  a 
circadian  rhythm  of  a  daily  cycle,  while  r  might  describe  changes  to  this  cycle  that 
are  only  noticeable  over  the  timespan  of  years. 
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The  simplest  approach  to  determining  the  choice  of  the  slow  time  variable,  is  to 
identify  the  combinations  of  s  and  t  present  in  secular  terms  in  the  regular  expansion. 
In  the  examples  from  Sect.  9.2,  r  =  st,  so  we  focus  on  this  case. 

The  first  step  is  to  perform  the  change  of  variables  (9.21),  where  by  using  the 
chain  rule,  we  arrive  at 


dx  dX  dXdx  dX  dX 

—  —  —  + - —  —  +  £  — , 

dt  dt  dr  dt  dt  dr 

and  similarly, 

d1 2x  d2X  d2X  ?d2X 

- 7T  —  - 7T  +  2S -  “h  £  7T  . 

dt 2  dt 2  dtdr  dr2 

The  autonomous  weakly-nonlinear  oscillator  equation 


d2x 

dt2 


(9.22) 


(9.23) 


(9.24) 


then  becomes 


d2X  d2X  9  d2X 

X7T  +  - 1"  6  ~X~T  +  %  =  sf  ( X ,  Xt  +  sXT) 

dtz  dtdr  drz 

which  can  be  rearranged  into  a  more  useful  form  as 

g.  +  X  =  s(f(X,X,  +  eX,)-  2X„)-'>X„.  (9.25) 

We  still  have  a  perturbed  linear  oscillator,  but  now,  we  must  specify  that  the  oscilla¬ 
tion  is  with  respect  to  the  fast  time  variable  t,  and  we  note  that  the  right-hand  side 
perturbation  involves  additional  terms  and  derivatives  with  respect  to  t  and  r .  It  may 
seem  peculiar  to  replace  the  simpler  ODE  (9.24)  by  a  more  complicated  partial  dif¬ 
ferential  equation,  but  (9.25)  can  still  be  solved  through  application  of  ODE  methods 
and  it  provides  the  degrees  of  freedom  necessary  to  properly  describe  the  system  over 
longer  times. 

The  next  step  is  to  expand  the  MMTS  solution  as  a  regular  perturbation  expansion, 

X(t,  r)  =  x0(t,  r)  +  sX\{t,  r)  +  0(s2),  (9.26) 

and  substitute  into  (9.25).  Analogous  to  the  Poincare-Lindstedt  method,  we  want  to 
suppress  any  resonant  terms  that  occur  in  order  to  determine  the  currently  unspecified 
parts  of  the  solution.  The  final  step  is  then  to  reconstruct  the  relationship  between 
fast  and  slow  times,  say  r  =  et ,  to  obtain  the  final  solution  as  x(t)  ~  Xo(t,  st). 


1  Other  problems,  and  additional  timescales  needed  for  higher  order  expansions  could  involve  higher- 

order  (slower)  timescales  such  as  Tk  =  skt  for  k  =  2,  3, . . .. 
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As  an  example,  we  consider  a  damped  nonlinear  oscillator, 
jc  djc 

— 7T  +  x  =  —  sk - b£x3,  v(0)  =  1,  x(0)  =  —2,  (9.27) 

dtz  dt 

for  s  — >  0.  Setting  r  =  st,  and  following  the  method  of  multiple  time-scales 
decomposes  the  problem  into  the  leading  order  problem 

XQtt  +  X0  =  0,  Xo(0, 0)  =  1,  Xo,(0, 0)  =  -2,  (9.28a) 

and  at  0(s), 

X0tt  +  X0  =  -KX0t+xl~2X0tT,  (9.28b) 

*i(0,0)  =  0,  Xi,(0,0)  =  -Xot(O.O). 

With  respect  to  the  fast-time  variable  t,  (9.28a)  has  a  solution  that  is  a  linear  com¬ 
bination  of  sinf  and  cost.  However,  unlike  (9.3),  since  other  (slow  time)  variables 
appear,  the  coefficients  in  the  linear  combination  are  not  constants,  but  are  functions 
dependent  on  the  slow-time  variable  r, 

Xo(t ,  r)  =  A(t)  sin  t  +  B(r)  cos  t.  (9.29) 

From  the  initial  conditions  on  Xo  at  t  =  r  =  0,  we  find  that 

A(0)  =  -2,  5(0)  =  1,  (9.30) 

but  otherwise,  A(r)  and  B(x)  are  as-yet  undetermined  functions. 

Moving  on  to  the  0(e)  problem  (9.28b),  after  substituting  in  (9.29),  we  obtain 

XUt  +Xl=  (-kA  +  \A2B  +  \B3  -  2^j  cos t 

+  (kB  +  |A3  +  \AB2  +  2^^  sinf 

+  O#3  -  I A2b\  cos(3 t)  -  Fa3  -  | AB2\  sin(3r).  (9.31) 

Note  that  many  of  the  terms  on  the  right-hand  side  of  (9.31)  are  a  consequence  of 
the  nonlinear  term  Xjj.  The  forcing  terms  must  be  expanded  out  as  the  sum  of  sines 
and  cosines  of  the  fast  time  scale  with  coefficients  that  can  only  depend  on  the  slow 
time  scale.  For  simple  nonlinear  products,  like  Xq,  this  can  done  be  through  the 
use  of  trigonometric  identities  (see  Appendix  A).  For  more  complicated  types  of 
nonlinear  forcing  terms,  the  right  side  of  (9.25)  should  be  replaced  by  its  Fourier 
series  expansion  (also  see  Appendix  A).  These  Fourier  series  decompositions  are 
essential  in  separating  out  the  resonant  and  non-resonant  forcing  terms.  For  (9.31), 
cos  t  and  sin  t  are  resonant  terms,  while  cos(3 1),  sin(3f)  are  non-resonant. 
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Fig.  9.4  Comparison  of  the 
regular  expansion  and  the 
method  of  multiple 
time- sc  ales  leading  order 
solution  against  the  exact 
solution  of  (9.27) 


To  eliminate  the  resonant  terms  that  would  cause  expansion  (9.26)  to  break-down, 
we  set  the  respective  coefficients  to  zero, 

o  o  o  o  d  A  o  o  o  o  d  B 

-kA  +  \A2B  +  \b3  -  2 —  =0,  kB  +  \A3  +  lAB2  +  2 —  =0.  (9.32) 

4  4  dr  4  4  dr 

These  two  equations  are  the  solvability  conditions  for  this  problem.  They  are  coupled 
ordinary  differential  equations  in  terms  of  the  slow-time  variable  that  describe  the 
evolution  of  the  amplitude  coefficients  A(r),  B( r)  in  solution  (9.29);  consequently, 
they  are  also  often  called  amplitude  equations. 

Solving  the  amplitude  equations  to  determine  A(r),  B(r)  subject  to  their  initial 
conditions  (9.30)  yields  the  MMTS  leading  order  approximation  of  the  solution 
(9.29),  as  illustrated  in  Fig.  9.4. 


9.5  Further  Directions 

Perturbation  methods  for  weakly  nonlinear  oscillators  have  been  developed  exten¬ 
sively  in  connection  with  many  applications  ranging  from  mechanical  oscillations 
and  electrical  systems  to  population  dynamics.  Some  more  detailed  introductory  pre¬ 
sentations  are  given  in  [48,  54,  56,  78],  and  some  advanced  treatments  are  given  in 
[11,  47,  58,  73,  77].  Besides  Poincare-Lindstedt  and  the  method  of  multiple  time- 
scales,  other  related  approaches  also  exist,  including  the  method  of  averaging  [77, 
92,  102]  and  near-identity  transformations  [58,  73].  The  engineering  approach  of 
harmonic  balance  is  related  to  these  methods. 

The  mathematical  theory  underpinning  the  solvability  conditions  coming  from 
the  Poincare-Lindstedt  and  method  of  multiple  time- scales  approaches  is  the  Fred¬ 
holm  alternative  theorem  [39,  93].  The  Fredholm  alternative  supplies  a  criterion 
for  the  existence  or  uniqueness  of  solutions  of  inhomogeneous  linear  problems  that 
applies  to  these  oscillator  equations  and  other  classes  of  perturbation  problems  (see 
Exercise  9.13). 
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9.1  Consider  the  problem  for  x(t)  with  s  — >  0, 

d2x 

— -  +  x  =  32  sx2,  x(0)=e~s,  x'(0)  =  2-\-s. 

dtz 

Obtain  the  first  two  terms  of  the  regular  expansion,  x(t)  =  xo(t)  +  sx\(t)  +  0(s2). 
Identify  the  homogeneous  solution,  resonant  response  and  non-resonant  response  in 
xi  (t). 

9.2  For  each  problem  use  the  Poincare-Lindstedt  expansion  with  the  two-term 
approximation  of  the  solution,  x(t)  ~  xo (0)  +  sx\(0)  with  0  =  (coo  +  £co\)t, 
for  £  — >  0  to  find  xo(0)  and  coo,  co\. 

(a)  Are  there  periodic  solutions  for  every  value  of  a  >  0  for 

d2x 

— —  +  4x  =  —  sx3,  v(0)  =  a,  vr(0)  =  0? 
dt z 

(b)  Find  the  value  for  a(>0)  that  yields  a  periodic  solution  of 


d2x  9  dx 

— —  +  9x  =  —s(x  —  1)-—,  x(0)  =  a , 

dtz  dt 


x'(0)  =  0. 


9.3  Apply  the  method  of  multiple  scales  with  r  =  st  to  the  van  der  Pol  oscillator 


d2x  9 

— —  +  9x  =  -six  - 
dtz 


dx 


(9.33) 


to  obtain  a  solution  in  the  form  x(t)  ~  Xo(t,  r)  =  A(r)  sin(3t)  +  B(x)  cos(3t). 

(a)  Determine  the  amplitude  equations  forA(r),  ^(r).  Determine  initial  conditions 
for  A,  B  in  terms  of  v(0),  v^O). 

(b)  Let  R(r)  =  \J A2  +  B2.  Determine  the  equation  for  dR/dr  =  f(R)  using  the 
amplitude  equations  from  part  (a).  Determine  the  equilibrium  values  for  R. 

9.4  Show  that  the  leading  order  MMTS  solution  for  weakly  nonlinear  oscillators 
can  be  written  in  the  polar  form 

Xo (t,  r)  =  R(r)  sinO^  +  (9.34) 

(a)  Relate  the  amplitude  R  and  phase  0  to  the  coefficients  A ,  B  introduced  in  (9.29). 
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(b)  Show  that  (9.32)  can  be  rewritten  as 


dR 

dr 


=  — i  kR, 


d& 

—  =  -iR 
dr 


3  d2 


and  solve  this  simpler  polar  system  with  appropriate  initial  conditions. 

9.5  Use  the  method  of  multiple  scales  to  investigate  the  near-resonant  behaviour  of 
the  damped,  driven  oscillator  for  x(t)  for  s  — >  0, 


d2x 


dx 


9  +  sfi - hi  +  sax  =  s  cos  (t  +  yst), 

dtz  dt 


with  given  parameters  a,  /3,  y.  Use  the  slow-timescale  r  =  st.  Note  the  presence  of 

r  in  the  forcing  term  on  the  right-hand  side.  (Hint:  write  the  forcing  as  cos(t  +  y  r)) 

(a)  Show  that  the  leading  order  solution  can  be  written  in  the  complex  form 

X0(t,  t)  =  C{x)eit  +  Ctz)e~it, 

where  z  =  x  —  iy  denotes  the  complex  conjugate  of  z  =  x  +  iy.  Relate  the 
complex- valued  function  C  to  the  real- valued  functions  A,  B  used  in  (9.29). 

(b)  Using  the  result  of  part  (a)  in  the  equation  for  X\  (t,  r),  find  the  two  solvability 
conditions.  Show  that  these  reduce  to  a  single  complex  equation  for  dC /dr . 

(c)  Entrainment  describes  a  solution  locking  onto  the  behaviour  entirely  set  by  a 
forcing  term,  leaving  no  direct  sign  of  the  natural  frequency  from  the  unforced 
problem  (i.e.  no  homogeneous  solution).  Setting  C(r)  =  Mel°  eiyT  in  your 
equation  from  (b),  obtain  an  equation  for  M,  the  real- valued  constant  amplitude 
of  the  entrained  solution,  with  0  being  a  (real- valued)  phase  constant.  Determine 
the  detuning  relation ,  y  =  y(M ),  relating  the  amplitude  to  the  frequency-offset 
from  resonance. 

9.6  Apply  the  method  of  multiple  scales  with  r  —  st  and  s  — >  0  to  the  problem 


d2x 
dt 2 


+  £ 


dx 

dt 


dx 

dt 


+  x  —  0, 


x(0)  =  0, 


x\0)  =  1. 


Using  the  polar  form  (9.34),  derive  and  solve  the  amplitude  equations  for  R(r)  and 
0(r)  to  obtain  the  leading  order  solution  x(t)  ~  Xo(t,  r). 

Hint:  You  will  need  to  calculate  some  terms  of  a  Fourier  series.  Write  the  series  in 
terms  of  the  variable  s  =  t  +  @  on  —ix  <  s  <  tv,  namely  f(s )  =  ar  sin^s)  + 
bk  cos(ks). 

9.7  In  Exercise  3.6,  the  equation  for  the  parametrically-driven  pendulum  was 
derived;  for  a  specific  choice  of  parameters,  it  can  be  written  as 

d2e 

dt 2 


+  4  sin  0  =  —  s  sin(4t)  sin  0. 
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Using  the  scaling  for  small-amplitude  oscillations,  0  =  8x  for  8  ->  0,  at  leading 
order  this  equation  yields  a  form  of  the  Mathieu  equation , 

d2x 

— 2  +  [  4  +  e  sin  (4/)]  X  =  0.  (9.35) 

Consider  solutions  of  the  Mathieu  equation  for  s  — >►  0: 

(a)  Use  the  method  of  multiple  scales  with  r  =  st  to  determine  the  amplitude 
equations  for  the  slowly  varying  coefficients  in  the  leading  order  solution. 

(b)  Solve  for  the  leading  order  solution  that  satisfies  the  initial  conditions 

x(0)  =  5,  *'(0)  =  6. 

9.8  Consider  the  equation  for  the  complex-valued  solution  x(t)  with  s  — >  0, 

dx  9 

- 1-  i 4x  =  s  cos(4 t)x 

dt 

Apply  the  method  of  multiple  time  scales  with  r  =  st  and  x(t)  ~  Xo(f,  r)  + 
sX\ (t,  r). 

(a)  Write  the  equations  for  Xo  and  X  \ . 

(b)  Write  the  general  solution  of  the  0(1)  equation. 

(c)  Determine  the  amplitude  equation  and  explain  the  condition  that  selects  this 
result. 

(d)  Determine  the  leading  order  solution  for  x(t)  that  satisfies  the  initial  condition 

v(0)  =  1  +  i. 


9.9  For  the  problem  with  e  — >  0, 


d2x 
dt 2 


+ 


X  =  £X 


2 


*(0)  =  1,  v/(0)=0, 


the  slow  timescale  for  MMTS  is  not  the  usual  one,  r  7 ^  st.  Attempt  a  regular 
expansion  v  ~  vo  +  sx\  +  £2X2  to  determine  the  slow  time  variable.  Determine  the 
leading  order  solution  using  the  Poincare-Lindstedt  method. 

9.10  For  s  — >  0,  show  that  there  is  a  large-amplitude  periodic  solution  of 


d2x  2 

- —  +  X  =  £X  +  £  COS  t 

dt2 


x'(0)  =  0. 


To  do  this,  let  x(t)  ~  e  ^X(t,  r)  with  r  =  sat  and  X  ~  Xo  +  sy X\  +  s2y X2  and 
select  a,  p,  y  >  0  to  ensure  that  Xq,  Xi,  X2  have  no  secular  growth  terms. 
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9.11  Delay  differential  equations  (DDE)  are  equations  that  involve  current  proper¬ 
ties  of  the  solution  coupled  to  the  solution  at  earlier  times  [35].  Consider  the  DDE 

d2x 

- 7T  x  H-  sx(t  —  2)  =  0. 

dtz 

(a)  This  is  a  linear  constant  coefficient  equation  and  can  be  solved  in  terms  of  trial 
solutions  of  the  form  x(t)  =  eXt .  Write  the  characteristic  equation  determining 
A  and  obtain  the  solutions  to  0(s)  for  s  — >  0.  Are  the  solutions  of  this  equation 
stable  or  unstable?  (see  Sect.  1.5) 

(b)  Apply  the  method  of  multiple  scales  to  obtain  the  amplitude  equations. 

(c)  Similarly  analyse  the  weakly -delayed  linear  oscillator  equation, 

d2x 

— — r — |-  x(t  —  s)  =0. 
dtz 


9.12  Now  we  complete  the  derivation  of  the  KdV  equation  that  was  started  in 
Exercise  8.5.  There,  we  obtained  the  equations  for  C(x ,  t)  =  Co  +  s2C\  +  •  •  •  and 
F(x,  t)  =  Fq  +  s2  F\  +  •  •  • ,  as 


0(1)  equations: 


9  Cp 
dt 


+  Co  =  0, 


3 Fq  82Cq 
dt  dx 2 


(9.36) 


0(s2)  equations: 


9Q  t  a3 c0  t  (dc0y 

dt  1  2  dtdx2  2  \  dx  ) 

m  d2Cx  =  i^Co  _F  d2Co_dFodCo 

dt  dx2  6  dx 4  °  dx2  dx  dx 


(9.37a) 

(9.37b) 


(a)  Show  from  the  0(1)  equations  that  Co  and  Fo  each  satisfy  the  classic  wave 
equation  (2.52)  with  unit  speed  and  having  independent  left/right  moving  waves. 

(b)  We  will  now  restrict  attention  just  to  right-moving  waves.  It  can  be  shown  that  the 
0(s2)  equations  would  lead  to  solutions  with  secular  growth,  hence  consider 
a  multiple-time  scale  expansion  of  the  form  C  =  co(z,  r)  +  s2c\(z,  r)  and 
F  =  fo(z,  r)  +  s2f\(z ,  r)  where  z  =  x  —  t  and  r  =  s2t.  Show  that  the  0(1) 
equations  reduce  to  a  single  relation  between  co ,  /o  and  that  the  O  (s2)  equations 
reduce  to  a  single  compatibility  condition  given  by  a  partial  differential  equation 
for  fo(z,  r),  the  KdV  equation, 


Vo 

dr 


(9.38) 


9.6  Exercises 


199 


9.13  The  Fredholm  alternative  theorem  is  the  key  principle  that  determines  the 
solvability  conditions  in  the  Poincare-Lindstedt  and  MMTS  methods,  but  here  we 
illustrate  how  it  also  applied  to  determining  the  eigenvalues  of  perturbed  matrices 

[All 

Consider  the  matrix 


A(S)  =  (3+'£-lV25Si„^ 

For  the  limit  £  ^  0,  use  the  following  steps  to  solve  the  eigenvalue  problem, 


Ax  =  Ax. 


(a)  Write  the  asymptotic  expansion  for  the  matrix  in  powers  of  £,  A  ~  Ao  +  eAi  + 
s2  A2  and  similar  expansions  for  each  eigenvector  and  corresponding  eigenvalue 
x  ~  xo  +  exi  +  £2X2,  and  A  ~  Ao  +  eAi  +  £2A2. 

(b)  The  leading  order  problem,  Aoxo  =  Aoxo,  can  be  written  in  the  form  of  a 
homogeneous  problem, 


(A0  -  A0I)x0  =  0 

Obtain  the  eigensolutions  for  this  matrix,  {Aq^,  Xq^}. 

(c)  For  this  problem,  every  vector  v  can  be  written  as  a  linear  combination  of  the 
leading  order  eigenvectors,  v  =  C(1)Xq^  +  C(2)Xq2).  Write  the  formulas  for 
fovk=  1 ,  2  in  terms  of  v  and  x^ . 

(d)  Write  the  0(e)  equation  in  the  form  of  an  inhomogeneous  problem  with  the 
same  matrix  operator  as  the  0(  1)  equation, 

(Ao  -  AoI)xi  =  fi(xo,  Ai,  Ai). 

Note  the  similarities  between  these  equations  for  xo,  xi  and  the  oscillator  prob¬ 
lems,  (9. 17a)  and  (9. 17b)  or  (9.28a)  and  (9.28b).  The  Fredholm  alternative  essen¬ 
tially  states  that 

The  solution  of  the  O  (■ en )  inhomogeneous  problem  will  exist  and  be  unique  if  the  forcing 
term  has  no  contribution  from  the  solution  of  the  0(s°)  homogeneous  problem.2 

For  oscillator  problems,  this  addresses  the  existence  of  periodic  solutions  in  the 
absence  of  resonant  forcing  terms. 


2This  is  the  simplified  version  for  symmetric  matrices  and  self-adjoint  differential  equations.  The 
general  version  of  the  Fredholm  alternative  is  similar: 

The  solution  of  the  non-homo geneous  problem  Aox  =  b  will  be  unique  if  and  only  if  the 
adjoint  problem  A^y  =  0  has  only  the  trivial  solution  [39,  93]. 
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For  matrix  equations,  non-existence  or  non-uniqueness  will  result  if  the  forcing 
term  includes  contributions  from  the  null  vector  corresponding  to  the  X^  in 
the  matrix  operator.  Select  the  values  of  X\  to  eliminate  those  contributions  and 
obtain  the  first  two  terms  in  the  expansions  of  the  eigenvalues,  A  ^  ~  Xq^  +sX^ 
for  k  =  1,2. 


Chapter  10 

Fast/slow  Dynamical  Systems 


In  previous  chapters  we  employed  perturbation  methods  to  study  problems  in  differ¬ 
ential  equations  where  the  solution  of  the  leading  order  problem  could  be  obtained 
in  a  simple  explicit  form.  For  problems  where  the  leading  order  solution  cannot  be 
obtained  in  a  simple  form,  a  different  approach  is  needed.  Here  we  show  that  ideas 
from  phase  plane  analysis  (Chap.  1)  can  be  used  to  circumvent  these  difficulties 
and  allow  us  to  apply  boundary  layer  methods  (Chap.  7)  to  separate  problems  into 
dynamics  occurring  at  different  scales. 

The  fundamental  principles  for  formulating  rate  equations  given  in  Chap.  1  are 
universally  applicable  for  constructing  models.  But  when  the  number  of  equations 
in  the  resulting  system  is  large  or  the  leading  order  problem  for  s  =  0  is  nonlinear, 
then  some  of  the  direct  approaches  that  we  have  seen  in  previous  chapters  become 
cumbersome  and  may  not  be  useful. 

Similarly,  Chap.  9  introduced  approaches  based  on  perturbation  methods  to  esti¬ 
mate  the  behaviour  of  some  models  yielding  oscillatory  behaviours.  But  those  meth¬ 
ods  were  limited  to  weakly  nonlinear  problems,  where  the  leading  order  solution 
satisfied  a  linear  oscillator  equation  and  could  be  calculated  explicitly  in  terms  of 
sines  and  cosines.  For  some  more  general  nonlinear  problems,  we  will  show  that 
properties  of  the  system  can  be  determined  without  the  need  for  a  closed-form  solu¬ 
tion. 

The  common  structure  for  problems  that  we  will  investigate  in  this  chapter  is  that 
of  singularly-perturbed  dynamical  systems.  In  Chap.  7,  we’ve  learned  that  singularly 
perturbed  differential  equations  exhibit  separation  of  spatial  scales,  having  rapid 
variations  in  boundary  layers  in  contrast  to  slowly- varying  outer  solutions.  Likewise 
in  Chap.  9,  a  separation  of  time-scales  occurred  between  the  oscillations  on  the  fast 
time  scale  and  evolution  of  the  amplitude  functions  on  slow  scales.  Extending  this 
approach  to  more  general  dynamical  systems,  we  seek  to  decompose  each  problem 
into  two  sub-systems: 

•  The  slow  system :  the  problem  in  terms  of  the  original  (“slow-time”)  variable, 
whose  solution  will  be  treated  analogously  to  the  outer  solution  in  a  boundary 
layer  problem. 
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•  The  fast  system:  a  rescaling  of  the  system  describing  rapid  evolution  occurring 
over  shorter  times,  analogous  to  the  inner  solution  of  a  boundary  layer  problem. 

The  solution  of  each  sub-system  will  be  sought  in  the  form  of  a  regular  perturbation 
expansion.  For  singularly-perturbed  problems,  the  sub-systems  will  have  simpler 
structures  than  the  full  problem  and  allow  for  the  slow-  and  fast-dynamics  to  poten¬ 
tially  be  characterised  in  terms  of  reduced  phase  line  or  phase  plane  dynamics. 

We  will  study  two  problems  that  could  be  considered  directly  in  terms  of  a  standard 
phase  plane  analysis,  but  through  the  use  of  perturbation  theory  (that  is  also  applicable 
to  higher  order  systems),  we  will  show  that  the  underlying  dynamics  can  be  separated 
into  more  convenient  forms. 


10.1  Strongly-Nonlinear  Oscillators:  The  van  der  Pol 
Equation 


For  oscillatory  systems  where  the  leading  problem  for  e  — >  0  does  not  yield  the 
harmonic  oscillator  equation,  Xq  (t)  +  vo  (t)  =  0,  the  approaches  from  Chap.  9  cannot 
be  applied  in  a  straightforward  manner.  As  an  example  of  a  “strongly  nonlinear ” 
oscillator,  we  will  show  how  to  construct  solutions  of  the  van  der  Pol  oscillator , 


d2x 

‘7?  +  (x 


dx 


+  v  =  0 


(10.1) 


This  equation  is  similar  to  the  weakly-nonlinear  damped  oscillator  (9.33),  but  here 
the  first-order  damping  term  is  part  of  the  leading  order  equation  while  the  second 
derivative  term  is  a  singular  perturbation. 1  Since  the  perturbation  parameter  here  mul¬ 
tiplies  the  highest  order  derivative,  this  type  of  problem  could  be  called  a  “singularly 
perturbed  oscillator”  but  the  more  common  name  is  a  relaxation  oscillator. 

We  begin  by  re-writing  (10.1)  in  terms  of  a  convenient  formulation  for  a  phase 
plane  analysis  but  will  use  a  different  choice  of  intermediate  variable  y(t)  than  the 
standard  velocity  (1.37).  Notice  that  the  first  two  terms  in  (10.1)  can  be  written  as  a 
total  derivative, 


d 

dt 


dx  i  o  \ 

£ - 1-  fX  —  V  I  +  V  =  0. 

dt  3  ) 


Defining  the  expression  in  parentheses  as  y  =  ex!  +  ^  v3  —x,  we  obtain  an  autonomous 
system  for  (. x(t ),  y(t )), 


dx  i  3 

£—-  =  y  +  v  -  , 

dt  5 


(10.2) 


Attempting  to  put  this  equation  in  the  form  (9.2)  fails  because  dividing  by  s  suggests  a  very  fast 
oscillation,  coq  =  e_1// 2  ->  oo,  and  a  large  (0(e-1)  ->  oo)  rather  than  small  perturbation  term. 
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which  is  called  the  Lienard  transformation  of  the  van  der  Pol  equation.  The 
singularly-perturbed  nature  of  this  system  is  clear:  for  e  =  0,  the  ODE  for  x(t) 
reduces  to  an  algebraic  relation. 

We  begin  by  determining  the  outer  (“slow”)  solution  using  regular  perturbation 
expansions  for  both  x,  y: 

x(t)  =  xo(t)  +  exi(t)  +  0(s2),  y(t )  =  yo(t)  +  eyi(t)  +  0(s2). 


Collecting  the  0(1)  terms  in  the  expansions  of  (10.2),  we  obtain  the  leading  order 
slow  system, 


0  =  Jo  +  *o  -  \xl, 


(10.3) 


The  differential  equation  for  yo(0  allows  us  to  qualitatively  describe  the  dynamics 
of  the  solution  in  terms  of  vertical  motion  in  the  phase  plane  as 


y0(0  is 


decreasing  for  vo  >  0, 
increasing  forvo  <  0; 


(10.4a) 


hence  the  yo-axis  is  the  y-nullcline  curve  (see  Sect.  1.5.1)  for  both  the  full  problem 
and  the  slow  system.  The  remaining  equation  in  (10.3)  restricts  the  slow-time  solution 
to  evolve  only  on  the  curve 


yo(*o)  =  -xo=  S(x o),  (10.4b) 

which  is  called  the  slow  manifold  (or  “slow  solution  curve”)  and  is  shown  in  Fig.  10. 1 . 

Restricting  the  solution  to  stay  on  the  slow  manifold  while  following  the  evolution 
described  by  (10.4a)  suggests  that  starting  from  any  vo  <  0  will  drive  the  solution 
upward  to  approach  the  local  maximum  of  the  curve  at  yo(—  1)  =  2/3,  while  the 
dynamics  in  the  right  half  plane,  with  vo  >  0  will  force  solutions  to  evolve  toward 
the  local  minimum  at  yo(l)  =  —2/3.  At  those  critical  points  (extrema),  the  vertical 
motion  still  follows  (10.4a)  even  though  there  is  no  obvious  place  to  go  on  the  slow 


Fig.  10.1  Arrows  indicating 
dynamics  of  the  leading 
order  solution  on  the  slow 
manifold  (10.4b)  and  at  the 
two  critical  points  of  the 
curve 
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manifold.  The  solution  therefore  gets  pushed  off  the  slow  manifold2;  everything  off 
the  slow  manifold  in  the  phase  plane  is  part  of  the  fast  problem ,  to  which  we  now 
turn  our  attention. 

Following  the  analysis  in  Chaps.  6  and  7,  in  order  to  find  singular  solutions,  we 
rescale  the  original  system  in  terms  of  new  variables.  To  determine  the  fast  variables, 
we  substitute  the  general  scalings 


T  =  t—f,  x  =  e^X(T),  y  =  SYY(T), 
£a 


(10.5) 


into  (10.2)  where  the  scaling  exponents  a,  /3,  y  and  the  time  /*  when  the  fast  dynamics 
occur  are  to  be  determined.  For  this  problem,  it  can  be  shown  that  x,y  are  always 
0(1),  so  we  take  ft  =  y  =  0  and  determine  the  value  for  <*(>0)  from  dominant 
balance  in 


,1—a 


dX 

dT 


=  Y  +X-\X3 


dY 

—  =  -X. 
dT 


In  applying  the  method  of  dominant  balance  to  systems  of  equations,  distinguished 
limits  will  typically  yield  a  balance  in  one  equation  at  a  time,  while  the  other  equa¬ 
tions  yield  sub-dominant  contributions,  where  the  rate  of  change  will  be  o(l).  The 
distinguished  limit  a  =  0  balances  the  terms  in  the  second  equation  and  reproduces 
the  original  slow  system  we  have  already  considered,  (10.2).  The  choice  a  =  1 
balances  all  of  the  terms  in  the  first  equation, 


dX 

dT 


=  Y+X-  ±X3, 


(10.6) 


where  we  have  multiplied  the  second  equation  by  sa . 
Seeking  the  fast  solutions  as  regular  expansions, 


X(T)  =  X0(T)  +  £X|  (T)  +  0(s2),  Y(T )  =  Y0(T)  +  sYi(T)  +  0(e2), 


yields  the  leading  order  fast  system, 


dX o 
~dT 


=  Y0+X 0-  ±X03, 


(10.7) 


The  second  equation  shows  that  the  dynamics  in  the  fast  system  (everywhere  in 
the  phase  plane  away  from  the  slow  manifold)  must  have  To(^)  =  constant,  i.e. 
only  horizontal  motion  is  admissible.  With  this  in  mind,  the  first  equation  can  be 
interpreted  as  describing  motion  on  a  phase-line  for  Xq(T), 


dX o 

—±  =  Yo-S(X o),  (10.8) 

dT 


'At  which  point,  the  solution  is  no  longer  described  by  the  dynamics  of  the  slow  system. 
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where  the  value  of  To  selects  a  “slice”  through  the  slow-manifold  analogous  to 
(1.29)  from  Chap.  1.  The  equilibrium  points  of  (10.8)  are  given  by  the  solutions  of 
S(X o)  =  To  with  their  stability  being  determined  by  the  sign  of  —S'(X o),  while  all 
other  values  for  Xq  give  monotonic  increasing  or  decreasing  solutions. 

In  summary,  for  the  slow  system,  we  have  identified  the  slow  manifold  and 
described  how  the  dynamics  on  the  slow  manifold  approach  the  critical  points.  Tak¬ 
ing  those  as  departure  points  into  the  fast  system,  the  subsequent  motion  is  given  by 
the  rapid  evolution  of  v  on  horizontal  lines  that  terminate  at  points  of  intersection 
with  the  slow  curve.  This  qualitative  description  allows  us  to  sketch  the  form  of  tra¬ 
jectories  starting  from  any  point  in  the  (x,  y)  phase  plane  and  identify  the  four- stage 
structure  (and  global  stability)  of  the  stable  finite-amplitude  limit  cycle  solution,  see 
Figs.  10.2  and  10.3.  The  general  features  of  the  limit  cycle  (maximum,  minimum  val¬ 
ues  and  period)  can  be  obtained  from  these  results  (see  Exercise  10.1).  The  fast-time 
dynamics  can  be  asymptotically  matched  to  outer  solutions  (slow  solutions),  and  so 
the  fast-dynamics  act  as  “interior”  boundary  layers  (in  the  terminology  of  Chap.  7). 


Fig.  10.2  Starting  from  general  initial  conditions  not  on  the  slow  manifold,  there  will  first  be  a 
rapid  horizontal  transition  or  “jump”  7  to  a  stable  branch  of  the  slow  manifold  followed  by  slow 
evolution  down  that  branch  2  to  the  equilibrium  point.  Dropping  off  the  minimum  yields  another 
fast  jump  3  across  to  the  other  stable  branch  of  the  slow  manifold,  where  slow  dynamics  drive  the 
solution  upwards  4  to  the  maximum  and  on  to  another  fast  jump  back  across  to  the  previous  stable 
branch  of  the  slow  manifold.  The  resulting  (periodic)  dynamics  correspond  to  an  attracting  limit 
cycle 


Fig.  10.3  Sketch  of  the 
time-profile  illustrating  the 
four  stages  in  the  van  der  Pol 
limit  cycle 


t 
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10.2  Complex  Chemical  Reactions:  The  Michaelis-Menten 
Model 

The  reactions  described  in  Sect.  1.2  can  be  said  to  be  “what  you  see  is  what  you 
get”  elementary  reactions  if  the  rate  equations  obtained  from  direct  applications  of 
the  the  law  of  mass  action  predicts  the  rates  as  would  be  observed  in  experimental 
studies  of  the  systems.  However,  for  general  reactions,  collisions  of  three  or  more 
molecules  almost  never  happen  exactly  simultaneously  (only  single-molecule  decay 
or  binary  collisions  typically  occur  as  elementary  steps).  The  term  non- elementary 
reactions  describes  the  overall  relation  between  reactants  and  products,  but  with  the 
molecular  collisions  and  mass  action  kinetics  involved  actually  occurring  in  many 
“hidden”  intermediate  sub-stages.  For  example,  a  non-elementary  version  of  (1.14a) 
could  be 

nA  +  mB ->  (???)  ->  (???)  ->  (???)  ->  pC  +  qD. 

The  temporary  products  from  intermediate  reactions  are  called  complexes  and  are 
often  unstable  compounds  that  exist  only  briefly. 

To  obtain  the  overall  rate  of  creation  of  products,  we  need  to  analyse  the  full  system 
with  all  intermediate  stages  expanded  out  as  elementary  reactions.  Part  of  the  very 
difficult  work  of  chemists  and  bio-chemists  is  to  determine  all  of  the  intermediate 
reactions.  If  all  of  the  intermediate  reactions  are  known,  then  we  may  have  a  very 
complicated  system  of  rate  ODEs.  Our  goal  is  to  try  to  condense  the  system  down  to 
just  the  overall  reaction 

reactants  products 

and  to  determine  the  effective  rate  equation  for  the  products.  The  theory  of  dynamical 
systems  can  be  applied  to  accomplish  this  for  many  chemical  reaction  systems. 
Given  a  complete  system  of  reactions  and  initial  conditions,  we  illustrate  through  an 
example  how  to  obtain  a  simple  model  for  the  rate  equations  of  the  products. 

A  simple  version  of  an  enzyme-mediated  chemical  reaction  process  is  given  by 
the  system, 

S  +  E  A  c  X  P  +E,  (10.9) 

ki 


where  S  is  the  “substrate”  reactant  and  P  is  the  concentration  of  the  desired  product. 
An  enzyme  (or  catalyst)  is  a  compound  whose  special  property  is  that  it  allows  for 
intermediate  reaction  steps  that  lead  to  the  overall  reaction,  in  this  case 

S^P,  (10.10) 

or  allow  it  proceed  more  rapidly  than  without  the  enzyme;  in  some  cases  this  direct 
transformation  might  not  be  possible  at  all  without  the  enzyme. 

Let  E  be  the  concentration  of  the  enzyme,  and  C  is  the  intermediate  “complex”  of 
SE  bound  together.  The  enzyme  is  typically  a  complicated  and  expensive  compound, 
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but  notably  it  is  not  actually  consumed  by  (10.9).  Consequently,  the  enzyme  may  be 
given  in  a  small  initial  concentration  Eo.  The  complex  C  is  an  unstable  temporary 
state  and  none  would  be  present  before  the  reaction  starts.  We  will  assume  that  the 
production  process  begins  with  no  product,  and  a  large  initial  supply  of  reactant,  So, 
all  to  be  converted  into  P.  Assuming  the  reaction  rate  constants  /q ,  k2,  k 3  are  known 
finite  values,  our  goal  is  to  determine  the  rate  of  production  for  P  and  how  it  can  be 
represented  in  simplest  form. 

Beginning  by  expanding  (10.9)  into  its  full  set  of  elementary  reactions, 

S  +  eXc  S  +  eEc  cEp  +  E,  (10.11) 


the  law  of  mass  action  yields  the  rate  equations  for  consumption/production  of  each 
chemical  species  in  the  dimensional  form 


d  P 

~dT 

dS 

dT 


k]  SE  +  k2C, 


dC 

dT 

dE 

dT 


=  kiSE-k2C-k3C, 

=  —  kjSE  +  k2C  +  k3C, 


(10.12a) 


subject  to  initial  conditions  at  t  =  0, 


S(0)  =  So,  E(0)  =  E0,  C(0)  =  0,  P(0)  =  0. 


(10.12b) 


We  now  nondimensionalize  system  (10.12a,  10.12b).  Given  the  one-to-one  corre¬ 
spondence  between  S  and  P  and  between  E  and  C  as  reactants/products  in  (10.11), 
we  scale  these  pairs  by  the  respective  initial  concentrations 

S(T)  =  S  P(T)  =  So p(t),  (10.13) 

E(T)  =  E0e(f),  C(T)  =  E0c(0,  T  =  Tt. 


We  will  go  on  to  select  the  timescale  T  later  below.  Through  these  scalings,  the 
nondimensional  concentrations  are  all  normalised  such  that 


0  <  e(t),  c(t)}  <  1. 

Our  previous  intuition  related  to  the  transformation  of  enzyme  to  complex  molecules, 
E  ^  C,  can  be  made  concrete  by  noting  that  the  total  amount  of  the  two  substances 
is  conserved  for  all  times,  following  directly  from  (10.12a), 

—  (C  +  E)  =  0  C(T)  +  E(T)  =  Eo, 
dl 


so  that  c  +  e  =  1 .  This  allows  us  to  use  e  =  1  —  c  to  eliminate  the  enzyme  from  the 
system,  leaving  the  problem  in  terms  of  s,  c,p. 
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We  take  the  initial  amount  of  substrate  to  be  finite,  So  =  0(1),  and  assume  that 
we  begin  with  a  relatively  small  amount  of  enzyme,  Eo  So.  We  now  select  the 
timescale  T  in  order  to  make  the  consumption  rate  ds/dt  =  0(1).  This  choice  yields 
T  =  1  /(ki Eo).  The  resulting  scaled  system  is  therefore  given  by 

s(  0)  =  1,  (10.14) 

c(0)  =  0, 

P(  0)  =  0, 

where  we  have  relabelled  dimensionless  combinations  of  parameters  as 

Eo  k2  k2  +  k2 

s  =  ^L«l  k=*=0(  1)  /x=^— -  =  0(1).  (10.15) 

bo  Kibo  Kibo 

We  will  consider  the  limit  of  a  very  small  amount  of  enzyme  in  the  system  (s  0) 
for  our  perturbation  analysis.  Noting  that  the  equations  for  s,  c  are  independent  of 
p ,  we  first  solve  for  the  s,  c  system  and  then  use  the  results  to  determine  p  from  the 
final  equation  in  (10.14). 

Since  (10.14)  is  a  singularly  perturbed  system  for  s  — >  0,  we  expect  a  separation 
between  fast  and  slow  time  scales.  Beginning  with  the  slow  system,  we  consider  reg¬ 
ular  expansions  for  the  solutions,  s  =  so(t)  +  0(s ),  c  =  co(t)  +  0(e).  Substituting 
these  into  (10.14)  yields  the  leading  order  slow  system, 


ds 

dt 

dc 

dt 

dp 

dt 


=  —s(  1  —  c)  +  Ac, 


=  s(l  —  c)  —  /UiC , 


=  (li-  A)c, 


dso 

dt 


-so(l  -  c0)  +  Ac0,  0  =  soil  ~  cq)  -  /ICQ, 


(10.16) 


with  the  second  equation  giving  an  algebraic  relationship  between  the  complex  and 
substrate  concentrations, 

a  co 

so(co)  =  (10.17) 

1  -  co 

this  is  the  slow  manifold  for  this  problem. 

We  note  that  our  initial  conditions,  c(t  =  0)  =  0,  s(t  =  0)  =  1  do  not  lie  on  the 
slow  manifold  and  hence  there  must  be  a  brief  initial  layer  at  /*  =  0,  governed  by  the 
dynamics  of  the  fast  system,  that  describes  the  transition  from  the  initial  conditions  to 
the  slow  manifold.  To  determine  the  form  of  the  fast  system,  we  rescale  the  variables 
as 

s  =  s(T),  c  =  c(T ),  r  = 


yielding 


£ 


—a 


dS 


1  —a 


dC 


dT 


5(1-C)  + AC,  e 


dT 


=  5(1  -  C)  -  iiC. 
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The  distinguished  limit  for  the  fast  system  is  found  to  occur  when  a  =  1  and  the 
leading  order  fast  system  is  then  given  by 

dS()  dCo 

=  0,  ~^  =  So(l  ~  Co)  -  mC0,  (10.18) 

with  initial  conditions  So(0)  =  1  and  Co(0)  =  0.  Consequently  there  is  no  change 
to  the  substrate  concentration  in  the  initial  layer  at  leading  order,  and  it  remains  at 
its  initial  value,  So(T)  =  1.  For  completeness,  we  could  determine  Co(T)  from  this 
system,  but  bearing  in  mind  that  our  final  goal  is  to  determine  p(t),  we  only  actually 
need  to  find  s(t)  as  we  can  replace  c{t)  in  terms  of  s(t)  (at  leading  order)  in  (10.14) 
using  the  slow  manifold  (10.17).  Asymptotic  matching  of  the  fast  and  slow  solutions 
for  s  yields  the  initial  conditions  for  the  slow  solution, 


lim  So(T)  =  1  =  lim^o(0- 

T— >oo  r— >0 


Hence,  while  the  initial  layer  has  a  dramatic  effect  on  the  concentration  of  the  com¬ 
plex,  it  essentially  leaves  the  initial  condition  on  s  from  (10.14)  unchanged.  The 
initial  layer  is  effectively  a  boundary  layer  with  respect  to  time  for  the  initial  value 
problem  for  c  in  (10.14).  Figure  10.4  shows  a  comparison  of  the  numerical  solution 
of  (10.14)  against  the  leading  order  fast/slow  dynamics  (the  slow  manifold  being 
described  by  (10.17)). 

Substituting  (10.17)  into  (10.14),  we  get  the  slow  system, 


dso 

dt 


li  —  k 

- - - s0, 

ll  +  so 


dpp 

dt 


fi  —  X 

- - - so, 

11  +  s  0 


(10.19) 


and  we  can  finally  confirm  our  expectation  that  the  total  of  the  substrate  and  product 
is  conserved,  d(so  +  po)/dt  =  0.  Using  the  initial  conditions,  we  have  effectively 
reduced  the  original  problem  to  solving  a  single  first  order  ODE  for  so(t),  with po(t) 
then  being  given  by  po(t)  =  1  —  so(t).  This  effective  nonlinear  rate  law  is  called 
the  Michaelis-Menten  law  and  is  used  extensively  in  the  modelling  of  biochemical 
systems. 


Fig.  10.4  The  cs  phase 

plane  showing  the  slow 

manifold  curve  (10.17)  and  a 

numerical  solution  of  the  full 

system  (10.14)  for  e  =  1/5 

starting  from  initial  «o 

condition  (c,  s)  =  (0,  1) 


c 
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10.3  Further  Directions 

More  extensive  presentations  of  dynamical  systems  are  given  in  many  texts,  both 
from  more  theoretical  [43,  70],  and  more  applied  standpoints  [54,  94].  Many  books 
illustrate  in  detail  the  use  of  perturbation  methods  for  studying  dynamical  systems 
[47,  54,  56,  73,  102],  with  the  van  der  Pol  oscillator  being  a  classic  example.  Mod¬ 
elling  and  applications  of  chemical  kinetic  systems  are  presented  more  thoroughly 
in  [37,  49,  57,  74].  Some  alternative  limiting  cases  of  the  enzyme  kinetics  system 
are  explored  in  [87,  90]. 


10.4  Exercises 


10.1  Consider  the  van  der  Pol  equation  for  x(t)  with  0  <  s  <<C  1, 


d2x 

£~dfl 


+  (3x2  —  6x  —  9) 


dx 

dt 


H-  4x  =  0. 


(a)  Determine  f(x)  so  that  this  equation  can  be  written  as  a  Lienard  phase  plane 
system  in  the  form 


dx 

s—  =f(x)+4y, 
dt 


dy 

dt 


=  —x. 


(b)  For  fixed  s  >  0,  find  the  equilibrium  point(s)  in  the  phase  plane,  find  their 
eigenvalues,  and  classify  their  linear  stability. 

(c)  Use  the  expansions  x(t)  =  xo(t)+sxi(t)  +  0(s2),  y(t)  =  yo(t)+syi(t)  +  0(s2), 
to  determine  the  equations  for  the  leading  order  slow  solution.  Sketch  the  slow 
manifold,  indicate  the  direction  of  motion  on  each  part,  and  identify  the  two 
attracting  points  on  the  curve. 

(d)  Use  the  expansions  x(t)  =  Xo(T)  +  eX \  (T)  +  0(s2),  y(t)  =  Yq(T)  +  sY\  (T)  + 
0(s 2)  with  T  =  t/s  to  obtain  the  equations  for  the  leading  order  fast  solution. 

(e)  Use  the  phase  plane  to  determine  the  maximum  and  minimum  values  of  x(t) 
during  an  oscillation,  see  Fig.  10.2.  Sketchx(t)  as  a  function  of  time,  see  Fig.  10.3. 

(f)  Using  the  time  required  for  the  slow  motions  in  (c)  (neglecting  the  short  times 
for  the  fast  solutions  (d)),  determine  the  leading  order  estimate  for  the  period  P 
of  oscillation  of  the  limit  cycle. 

Hint:  Find  the  time  spent  moving  along  each  of  the  slow  curves  by  obtaining  an 
equation  dx$/dt  =  g(vo)  from  (c)  and  then  separate  variables  to  write 


dx  o 


=  g(xo) 


‘tend 

=>  P  —  I  dt  = 

4tart 


’JCend  dx 


g(x) 


dt 


-^start 
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and  finally  integrate  over  the  ranges  in  v  that  are  appropriate  to  each  of  the  two 
slow  segments  and  adding  together  those  two  times. 


10.2  We  now  consider  different  limits  for  a  dynamical  system  describing  a  chemical 
reaction  problem  in  terms  of  three  variables,  see  (4.50).  Consider  the  limit  s  0 
for  each  of  the  following  cases, 

(a)  For  the  system 


dx 

dt 

dy 

dt 

dz 
£ — 
dt 


=  2  -y, 

VO) 

=  x  -  z, 

v(0) 

?  1  s 

=  y-y  +3 y  -z. 

VO) 

1, 

3, 

0. 


(10.20) 


Identify  the  surface  z  =  S(x,  y )  that  defines  the  slow  manifold.  Find  the  equi¬ 
librium  point  of  the  leading  order  slow  phase  plane  system  and  show  that  it  is 
asymptotically  stable  for  t  — >  oo.  Also  determine  the  form  of  the  initial  layer 
that  describes  the  transition  from  the  initial  conditions  to  the  slow  manifold. 

(b)  For  the  system 


dx 

dt 

dy 
£  — 
dt 

dz 

dt 


=  2-y, 

VO) 

=  x  —  z, 

VO) 

?  1  s 

=  y-y  +  p  -  z. 

VO) 

0, 

3, 

1. 


(10.21) 


Show  that  the  slow  manifold  reduces  to  a  curve  that  could  be  written  in  parametric 
form  as  v  =  x(z),y  =  y(z),z  =  z.  Determine  the  asymptotic  solution  for 
t  ->  oo.  Also  determine  the  form  of  the  initial  layer  that  describes  the  transition 
from  the  initial  conditions  to  the  slow  manifold. 


10.3  Consider  the  problem  of  forming  a  “tri-mer”  (a  three  segment  polymer  mole¬ 
cule)  from  three  mono-mer  molecules, 


3A 


This  is  an  example  of  polymerisation.  It  is  a  non-elementary  reaction  and  takes  place 
via  intermediate  stages.  Call  the  tri-mer  “C”  and  the  di-mer  (A2)  “ B ”.  Suppose  that 
the  full  reaction  mechanism  is  given  by 
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(a)  Write  the  dimensional  rate  equations. 

(b)  Nondimensionalize  using  the  scalings 

A(T)  =  A0  a(t)  B(T)  =  B  0b(t)  C(T)  =  A  0c(t)  T  =  Tt 

where  Ao  is  the  initial  concentration. 

(c)  Let  T  =  l/(kiAo)  and  assume  that  s  =  Bo/Ao  — >  0. 

Identify  the  other  dimensionless  parameters  (call  them  TJ\,  TJ2)  and  state  what 
asymptotic  relations  on  the  rate  constants  kn  must  hold  if  we  assume  that  only 
the  sdb/dt  terms  vanishes  from  the  leading  order  slow  system. 

Hint:  Normalise  the  coefficient  of  the  -\-a 2  term. 

(d)  Find  the  leading  order  slow  manifold  and  write  the  dimensional  equations  for 
the  long-term  rate  of  production,  and  hence  find  G(A)  and  F(A)  in 


dA 

dJ 


=  -G(A), 


dC 

dJ 


=  F(A). 


10.4  Consider  the  system  of  chemical  reactions 


A  +  X 


h 


A  +  Y 


h 


2X 


2  Y 


P 


where  the  concentration  of  A  is  kept  constant  and  ki ,  ,  ^3 ,  k\  are  given. 

(a)  Write  the  rate  equations  for  x(t )  and  y(t). 

(b)  Nondimensionalize  using  T  =  Tt,  X  =  Xx,  Y  =  Y y.  Let  T  =  1  /(ki A) . 
Determine  X  and  Y  so  that:  (1)  all  of  the  terms  in  the  dx/dt  equation  and  (2)  the 
y2  term  in  the  dy/dt  equation  are  normalised. 

(c)  Determine  the  remaining  independent  dimensionless  parameters  (call  them  a ,  /3 ) 
and  write  the  nondimensionalized  equations. 

(d)  In  terms  of  a,  /3,  determine  the  concentrations  x,  y  for  the  positive  equilibrium 
solution. 

10.5  In  chemistry,  a  widely-used  short-cut  avoiding  the  full  scaling  and  slow/fast 
perturbation  analysis  is  to  jump  to  the  leading  order  slow  equations  using  the  assump¬ 
tion  that  the  rate  of  production  of  all  intermediates  equilibrate  (i.e.  reduce  to  steady 
algebraic  relations,  like  s  ^  =0);  this  is  called  the  Quasi-Steady-State  Assumption 
(QSSA). 

Use  the  QSSA  approach  to  consider  the  overall  reaction  for  the  formation  of 
hydrogen-bromide:  H2  +  Br2  — >  2HBr.  The  reaction  takes  place  through  several 
steps: 


k\ 

Bv2  ^  2Br 

h 


Br  +  H2  H  +  HBr 

k\ 


H  +  Br2 


HBr  +  Br 


Consider  the  atomic  forms  H  and  Br  to  be  unstable  intermediates  (similar  to  com¬ 
plexes  being  unstable  intermediates).  Apply  the  QSSA  to  obtain  the  dimensional 
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rate  law  for  the  production  of  HBr  (written  here  using  the  chemistry-notation  of  [X] 
being  the  concentration  of  chemical  X)  [6], 

d[HBr]  _  a[H2][Br2]3/2 
dl  ~  | Bi2  I  +  /j[HBr| 

Find  a,  fi.  Hint:  Write  A  =  [H2],  B  =  [Br2],  C  =  [Br],  D  =  [H],  P  =  [HBr]  for 
doing  your  algebra. 


Chapter  11 

Reduced  Models  for  PDE  Problems 


In  previous  chapters,  we  have  considered  methods  for  solving  problems  in  partial 
differential  equations:  the  method  of  characteristics  in  Chap.  2,  similarity  solutions 
in  Chap.  5,  and  Fourier  series  in  Chap.  8.  Now  we  pursue  another  approach  that  can 
be  applied  to  problems  where  we  want  to  obtain  certain  properties  of  the  solution, 
but  do  not  need  an  explicit  representation  of  the  entire  solution.  We  will  see  that  in 
some  cases,  this  leads  to  substantially  shorter  calculations. 

The  approach  of  reduced  models  is  to  reformulate  a  problem  into  a  simpler  form 
that  preserves  the  properties  of  interest,  but  decouples  them  from  the  calculation  of 
other  details  of  the  solution’s  behaviour.  We  will  describe  the  method  of  moments, 
which  produces  reduced  models  that  can  describe  the  evolution  of  some  quantities 
for  the  solution  averaged  over  the  domain. 1 


11.1  The  Method  of  Moments 

For  a  solution  p(x,t )  of  a  PDE  problem  on  a  domain  D ,  we  can  define  the  moment 
integrals, 

Mn(t)  =  /  xnp(x,t)dx  n  —  0,1,2,...  (11.1) 

JD 

If  p  represents  a  density,  then  the  n  =  0  integral  corresponds  to  the  total  mass.  For 
some  problems,  we  can  determine  a  simple  set  of  equations  governing  the  evolution 
of  the  moment  integrals  (without  explicitly  finding  the  solution  p(x,  t))  and  then  use 
those  results  to  infer  information  about  the  behaviour  of  the  solution  to  the  problem. 


1  We  have  already  encountered  reduced  dimension  models  in  Chap.  8,  where  we  developed  a  method¬ 
ology  for  solving  PDEs  on  slender  two-dimensional  domains  by  replacing  them  with  ODE  problems 
for  an  asymptotic  solution  applying  on  most  of  the  domain. 
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We  illustrate  this  approach  for  the  heat  equation  on  —  oo  <  v  <  oo 


dp  d2p 
dt  dx 2  ’ 


(11.2a) 


with  far- field  boundary  conditions  u  ->  0  as  \x\  — >  oo  and  initial  condition 


p(x,  t  =  0)  =  /(*), 


(11.2b) 


where  /  — >  0  rapidly  as  \x\  — >  oo. 

Consider  the  zeroth  moment  (mass)  integral, 


•oo 


Mo(t)  =  /  p(x,t)dx. 


-oo 


In  order  to  obtain  an  equation  to  describe  the  evolution  of  Mo(t ),  we  calculate  its 
rate  of  change  with  respect  to  time 


dM() 

dt 


dt  J — oo 


where  we  have  interchanged  the  order  of  differentiation  (with  respect  to  time)  and 
integration  (over  space).  Using  the  PDE  (11.2a)  to  replace  the  time  derivative  with 
the  second  order  spatial  derivative  and  then  integrating  yields 


dM{) 

dt 


oo 

=  0-0  =  0, 

— oo 


where  we  have  used  that  px  ->  0  as  | x  |  — >  oo  for  smooth  p  — >  constant  (specifically 
p  — >  0)  in  evaluating  the  boundary  terms.  Consequently,  we  have 

dM() 

-71  =°, 

dt 

so  that  the  total  mass  is  constant  in  the  problem  and  its  value  for  problem  (11.2a, 
11.2b)  is  determined  by  its  initial  value  at  t  =  0, 


oo 


M0(0)  = 


fix)  dx. 


(11.3) 
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We  can  proceed  similarly  for  the  first  moment,  M\  ( t )  =  J ^  xp  dx ,  for  which  we 
find 


dM  i 


/  * - 

J  —  DC  dt 


dx 


dx. 


Making  use  of  integration  by  parts  and  employing  further  assumptions  about  the 
decay  of  the  solution  as  \x\  — >  oo  being  sufficiently  rapid  to  eliminate  boundary 
terms  from  integration  by  parts,  we  obtain 


dM  i  dp 

-  =  x  — 

dt  dx 


dp 

—  dx  =  0. 
dx 


Therefore,  M\  is  also  a  constant,  also  set  by  the  initial  conditions  as 


Mi(0)  = 


•oo 


xf(x)  dx 


-DC 


(11.4) 


Having  these  moments  allow  us  to  evaluate  the  centre  of  mass  (the  average  value 
of  position  weighted  by  the  density),  defined  by  the  ratio  of  the  first  moment  to  the 
mass, 


f  xf  dx 
f  f  dx  ' 


(11.5) 


Investigating  the  evolution  of  v  can  provide  an  understanding  of  whether  the  solution 
is  generally  moving  in  some  direction  or  remaining  in  a  fixed  location;  in  this  case, 
v  remains  fixed  at  its  initial  position  as  both  Mo  and  M\  are  constants. 

Proceeding  to  the  second  moment,  we  find 


where  the  boundary  conditions  have  been  used  to  successively  eliminate  the  boundary 
terms  from  the  two  applications  of  integration  by  parts.  Consequently,  the  rate  of 
change  of  the  second  moment  is  given  in  terms  of  the  mass  Mo, 
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dM2 

dt 


=  2  M0. 


(H-6) 


For  the  heat  equation,  we  have  already  shown  that  the  mass  is  constant  and  so  the 
second  moment  increases  at  a  constant  rate.  We  can  therefore  express  M2  in  terms 
of  the  initial  conditions  as 


M2(t)=2t 


f  dx  + 


■oo 


x2  f  dx, 


—  CO 


(11.7) 


The  variance  is  defined  as  the  second  moment  of  p  with  respect  to  the  centre  of  mass 
and  yields  a  fundamental  measure  of  the  spreading  of  the  solution 


■CO 


V (t)  =  I  (x  —  x)2 p  dx 


-CO 

■co  roc 

.2 


x  p  dx  —  2x  /  xpdx-hx 


-2 


■co 


p  dx 


-00 


-CO 


-co 


=  M2  —  2xM\  +  x2Mq 

m}  M } 

=  M2-2— L  +  ~±Mo 

Mo  Mq 
Mr 

=  M2--±. 

M0 

The  variance  is  also  used  to  define  the  standard  deviation  in  terms  of  its  square-root, 
cr  =  VTVT,  yielding 


°2(t)  = 


M2- 


M2 

Mo 


(11.8) 


While  the  terminology  used  above  comes  from  probability  and  statistics,  the  defin¬ 
itions  are  direct  analogues  of  the  formulas  for  the  moment  of  inertia  (V  — >  I)  and 
radius  of  gyration  (cr  — >  R)  used  in  mechanics  for  describing  the  motion  of  objects 
[40,  67]. 

A  somewhat  more  direct  approach  for  constructing  moment  equations  is  to  directly 
the  integrate  the  product  of  the  weight  function  with  the  PDE  over  the  domain, 


jc^(PDE)  dx, 


D 
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Consider  for  example  the  following  problem  for  the  wave  equation  on  the  finite 
domain,  0  <  v  <  1 , 


(11.9a) 


subject  to  boundary  conditions 


dp 

dx 


x=0 


dp 

dx 


x — 1 


(11.9b) 


and  initial  conditions 


p(x,  0)  =  1 


cos(37rv). 


(11.9c) 


Integrating  (11.9a)  over  the  domain  and  the  applying  boundary  conditions  yields  an 
ODE  for  the  zeroth  moment, 


(ptt  =  Pxx)  dx 


Integrating  the  initial  conditions  for  p  generates  initial  conditions  for  Mo,  Mo(0)  =  1 
and  Mq( 0)  =  0  and  consequently  yields  Mo(t)  =  t4  +  t  +  2  —  ef . 

In  general,  the  method  of  moments  is  considered  to  be  successful  when  it  produces 
a  finite,  closed  set  of  ODEs  that  can  used  to  describe  selected  solution  properties.  If 
the  moment  equations  depend  on  an  indefinite  number  of  further  moments  or  other 
properties  of  p,  then  the  system  is  not  closed.  In  such  cases,  the  system  may  be 
approximated  through  the  use  of  problem- specific  closure  relations ,  as  used  in  the 
modelling  of  turbulent  fluid  flows.  While  this  method  has  been  described  here  as 
an  analytical  approach,  similar  analysis  is  used  to  reduce  PDE  problems  to  simpler 
systems  of  equations  that  can  be  evaluated  computationally  in  numerical  methods 
such  as  Galerkin  projection  methods. 

We  now  go  on  to  consider  two  classic  applied  mathematics  problems  that  have 
different  relations  to  the  method  of  moments. 


11.2  Hiring  Instability  and  Pattern  Formation 

In  this  section,  we  present  an  example  of  how  to  analyse  the  development  of  patterns 
in  PDE  models,  as  occur  in  important  systems  in  mathematical  biology  and  other 
applications.  The  problem  considered  will  also  show  that  predictions  from  reduced 
models  can  sometimes  be  misleading  and  must  be  considered  with  caution. 
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We  have  observed  that  solutions  of  the  heat  equation  will  generically  exhibit  dif¬ 
fusive  spreading  (as  implied  by  a  (t)  increasing  with  time  from  (1 1.8)  and  similarity 
solutions  having  v  =  0(^/t)  from  Sect.  5.5).  Consider  the  problem  for  the  heat 
equation  on  a  finite  domain  with  no-flux  boundary  conditions, 


dp  d2  p 
dt  dx2 ’ 


0  <  x  <  7 r, 


dp 

dx 


=  0, 

x=0 


dp 

dx 


=  0, 

X=7 T 


with  a  given  initial  condition 


p(x,  0)  =  f(x). 

The  mass  of  the  solution  will  be  conserved  for  all  times.  Further,  it  can  be  shown  that 
for  t  — >  oo,  the  solution  will  approach  the  average  value  set  by  the  initial  condition 

-  1  fn 

p(v,  t  — >  oo)  ->  /  where  f  =  —  f(x)dx. 

n  Jo 

Similarly,  adding  diffusive  effects  to  a  transport  equation  can  generally  be  expected 
to  cause  the  solution  to  both  spread  and  smooth  out.  Likewise,  if  diffusion  is  incor¬ 
porated  into  a  model  for  chemical  reaction  kinetics  to  yield  a  partial  differential 
equation,  as  in 

dc  dc  d2c 

—  —  —c  =>-  —  =  — 7T  —  c, 

dt  dt  dx2 

then  it  can  be  shown  that  the  spatial  variation  in  the  initial  condition  c(x,  0)  will 
eventually  level-out  as  the  solution  approaches  the  solution  of  the  original  ODE, 
c(x,t)  — >  c(t)  (see  Exercise  11.3). 

However,  the  work  of  Alan  Turing  (1912-1954)  showed  that  this  intuition  is  not 
always  a  good  guide  for  more  complicated  systems.  In  the  context  of  developmental 
biology  [100],  he  showed  that  the  addition  of  diffusive  effects  to  a  system  of  reactions 
could  enhance  spatial  structure  rather  than  suppressing  it. 

To  illustrate  this  effect,  let  us  consider  a  simple  system  involving  two  chemicals,  P 
and  Q,  that  interact  and  are  also  supplied  into  or  drained  out  of  the  system  according 
to 

P  +  Q  ->  2P  P  — ►  (drain)  Q  — >  2 Q  (source)  — >►  Q. 

These  two  substances  may  diffuse  at  different  rates,  so  we  will  describe  their  spa¬ 
tial  evolution  using  diffusion  terms  with  different  coefficients.  Hence,  consider  the 
nondimensionalized  governing  equations, 
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dp  d2p  dq  d2q 

— -  =  2pq  —  4  +  — — ,  — -  =  —  2pq  +  q  +  3  +  D— —  (11.10a) 

dt  dxz  dt  dxz 

where  D  >  1  is  the  ratio  of  diffusion  coefficients  giving  the  relative  spreading  rate  of 
Q  compared  to  P .  Let  the  domain  be  0  <  x  <  tv,  with  no-flux  boundary  conditions, 

dp  dq 

—  =  0  and  — =0  at  v  =  0  and  v  =  tt  (11.10b) 

dx  dx 

and  initial  conditions  at  t  =  0, 

p(x,  0)  =  po(x),  q(x,  0)  =  qo(x).  (11.10c) 

This  is  a  simple  example  of  a  re  action- diffusion  system.  Research  on  similar  systems 
has  shown  that  they  can  produce  surprisingly  complicated  results,  so  the  general 
approach  to  studying  reaction-diffusion  problems  is  to  start  with  the  simplest  possible 
solutions  and  build  up  an  understanding  from  there. 

Seeking  time-independent  steady  states  reduces  the  PDEs  for  p(x,  t),  q(x,  t)  to 
two  coupled  nonlinear  ODEs  for  p(x),  q(x ),  which  can  still  be  difficult  to  solve. 
Hence,  we  further  constrain  our  search  and  obtain  spatially  uniform  (constant)  solu¬ 
tions,  p *,  q*  from  the  algebraic  relations 

0  =  2p*q*—  4,  0  =  —  2p*q*  +  q*  +  3,  (11.11) 

which  yield  the  equilibrium  solution  p *  =  2,  q*  =  1.  To  explore  the  stability  of  this 
state  to  small  perturbations,  let 


P  =  P*  +  ep(x,  t),  q  =  q*  +  sq(x,  t ) 


(11.12) 


with  £  ^  0.  Treating  these  expression  as  perturbation  expansions,  we  substitute 
into  (11.10a).  At  leading  order  we  recover  the  steady-state  equations  (11.11),  while 
at  O(s)  we  get  the  linearised  version  of  (11.10a), 


dp 

~dt 


=  2p  +  4q  + 


d2p 


dq 

~dt 


3  q  T  D 


(11.13) 


Applying  the  method  of  moments  to  this  system,  we  can  determine  the  evolution 
of  the  mean  values  of  p  and  q  from  the  zeroth  moment  integrals, 


1  fn  „  \  ~ 

Po(t)  =  —  /  p(x,t)dx,  Qo(t)  =  —  /  q(x,t)dx. 
n  Jo  n  JO 


Integrating  (11.13)  over  the  domain  and  applying  the  boundary  conditions  yields  the 
coupled  ODEs, 


222 


dQo 

dt 
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dPo 

dt 


—  2Pq  +  4<2o, 


=  -2P0-3Q0.  (11.14) 


Note  that  (11.14)  corresponds  to  (11.13)  restricted  to  spatially  uniform  solutions. 
A  phase  plane  analysis  of  this  system  shows  that  (Po,  Qo)  =  (0,  0)  is  the  only 
equilibrium  point  and  has  eigenvalues  X  =  (—  1  db  i\f‘ 7)/2,  so  it  is  a  stable  spiral 
point.  Hence  all  solutions  of  (11.14)  will  converge  to  (Po,  Qo)  =  (0,  0)  as  t  — >  oo. 
While  this  correctly  predicts  the  evolution  of  the  means  of  the  perturbations  p  and 
q ,  further  analysis  is  needed  to  describe  their  spatial  structure. 

The  process  of  reducing  a  nonlinear  problem,  like  (11.10a),  to  a  linear  problem, 
(11.13),  by  focusing  on  the  evolution  of  solutions  starting  near  an  equilibrium  state 
is  a  classic  example  of  linear  stability  analysis.  We  applied  this  approach  to  ODEs  in 
Chap.  1;  this  is  also  one  of  the  most  general  approaches  to  exploring  the  behaviour 
of  nonlinear  PDEs  [26] . 

To  complete  the  analysis,  we  make  use  of  the  linearity  of  system  (1 1.13)  to  obtain 
solutions  as  linear  superpositions  of  eigenmodes,  p  =  pk(x,  t ),  and  apply  sep¬ 
aration  of  variables  to  express  the  eigenmodes  as  products  pk(x,  t)  =  fk(x)gk(t ) 
[44].  Requiring  each  eigenmode  to  satisfy  the  boundary  conditions  (11.10b)  yields 


/ pk(x,  0\  =  (ak\ 
\qk(x,t)J  \bk) 


cos  (kx)eXkt , 


(11.15) 


where  ak,bk,  Xk  are  constants  depending  on  the  wavenumber  k  =  0,  1,  2, _ Sub¬ 

stituting  each  independent  mode  into  (11.13)  and  eliminating  common  factors,  we 
arrive  at  the  matrix  eigenvalue  problem 


(11.16) 


For  k  =  0,  this  is  the  eigenvalue  problem  that  would  be  obtained  from  the  stability 
analysis  of  the  ODE  system  (11.14).  More  generally,  for  k  >  0,  we  can  determine 
the  stability  of  the  eigenmodes  from  solving  the  characteristic  polynomial  for  Xk 

(Xk+k2  -2)(Afc+&2D  +  3)+8  =  0.  (11.17) 


The  relation  between  the  spatial  wavenumber  and  the  real  part  of  the  dominant 
eigenvalue,  cr(k)  =  Re(A^)  is  often  called  the  dispersion  relation  and  concisely 
conveys  the  PDE  stability  results  with  respect  to  all  admissible  types  of  perturbations, 
as  represented  by  the  eigenmodes  with  different  values  of  k. 

For  the  case  where  Q  diffuses  much  more  rapidly  than  P  (D  1)  it  can  be  shown 
that  Re(k^)  can  become  positive  and  hence  some  eigenmodes  will  grow  in  amplitude; 
this  is  known  as  a  Turing  instability.  The  growth  rates  for  system  (1 E13)  when  D  = 
100  are  plotted  in  Fig.  1 E 1 ,  where  it  can  be  seen  that  k  =  1  is  selected  as  the  dominant 
growing  mode  (the  only  integer- valued  k  with  a  positive  growth  rate).  Consequently, 
this  mode  will  eventually  dominate  the  dynamics  of  the  system  and  produce  a  pattern 
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Fig.  11.1  The  dispersion 
relation  showing  growth 
rates  of  eigenmodes  (11.15) 
with  k  =  0,  1,  2,  ...  of  the 
linearised  system  (11.13) 
with  D  =  100 


k 


resembling  cos(v)  unless  the  initial  conditions  (11 .10c)  have  no  contribution  from  the 
k  —  1  unstable  mode,  i.e.  ( po ,  qo )  cos(v)  dx  =  (0,  0).  We  note  that  the  influence 
of  the  parameter  D  (giving  the  relative  rates  of  diffusion)  dramatically  changes  the 
stability  of  the  equilibrium  solution  for  different  wavenumbers:  a  stable  spiral  for 
k  =  0,  a  saddle  point  for  k  =  1,  and  stable  node  for  &  =  2,  3,  ... . 

For  different  choices  of  system  parameters,  there  may  be  a  band  of  unstable 
wavenumbers,  k  e  {1,2 , ,kc}  with  a(k)  >  0,  that  will  all  grow  until  nonlinear 
coupling  effects  take  over  to  determine  the  further  dynamics  of  the  solution.  Systems 
having  a(k)  >  0  below  some  critical  wavenumber,  0  <  k  <  kc,  are  often  called 
long-wave  unstable  since  they  involve  spatial  variations  involving  only  the  longest 
wavelength  eigenmodes.  The  Turing  instability  has  been  used  in  many  studies  as  a 
model  for  biological  pattern  formation,  such  as  the  spots  and  stripes  on  animal  coats 
[26,  42,  74]. 

We  note  that  in  attempting  to  use  the  method  of  moments  for  this  problem,  we 
considered  the  zeroth  moments  of  the  perturbations  in  the  linearised  system  (11.13) 
rather  than  in  the  original  nonlinear  system.  There  would  have  been  a  difficulty  in 
integrating  (11 .10a)  directly,  as  it  might  not  be  immediately  clear  how  to  analyse  the 
integral  of  the  nonlinear  term,  pq  dx .  In  the  next  section,  we  will  see  how  this 
type  of  issue  can  be  dealt  with  in  the  context  of  a  different  system. 


11.3  Taylor  Dispersion  and  Enhanced  Diffusion 

We  now  describe  a  classic  model  from  fluid  dynamics  that  illustrates  the  effectiveness 
of  reduction  via  the  moment-type  approach. 

If  a  substance  (called  the  solute )  is  released  into  a  channel  containing  a  flowing 
stream  of  fluid,  the  substance  will  spread  out  as  it  is  carried  downstream  by  the  flow. 
Part  of  this  behaviour  is  due  to  standard  “molecular  diffusion”  effects,  but  spatial 
variations  in  the  flow  velocity  also  contribute  since  the  different  transport  speeds  will 
broaden  the  area  of  distribution  of  the  solute. 
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This  problem  was  studied  by  G.I.  Taylor  (1886-1975),  who  explained  how  the 
full  transport  problem  for  the  solute  could  be  reduced  to  a  one-dimensional  model. 
Taylor’s  model  showed  that  convective  effects  can  increase  the  effect  of  diffusion; 
this  has  come  to  be  known  as  Taylor  dispersion  and  has  been  applied  to  describe  the 
spread  of  pollutants  in  rivers,  drugs  in  blood  flow  as  well  as  in  many  other  settings. 
Taylor’s  original  paper  [98]  was  presented  in  his  unique  and  very  physically  intuitive 
style;  it  is  deceptively  short  and  challenging  to  follow.  Here  we  present  a  derivation 
of  Taylor’s  result  that  is  very  similar  to  the  approach  given  by  Leal  [61,  Sect.  3H] 
(also  see  [20]). 

Consider  a  dimensional  transport  problem  for  the  concentration  of  a  solute 
C  (X,  Y,  T),  given  by  the  advection-diffusion  equation 

dC  o 

— +  V-(CU)  =  DV2C,  (11.18a) 

9T 

where  U(X,  Y,  T)  is  the  fluid  velocity  field  and  D  is  the  constant  of  molecular 
diffusion.  We  choose  the  domain  to  be  a  two-dimensional  long  slender  channel, 
— oo  <  X  <  oo,  — H  <  Y  <  H,  and  let  the  velocity  field  take  the  classic  Poiseuille 
flow  parabolic  profile 


U  =  U0 


(-£) 


-H  <  Y  <  H, 


(11.18b) 


where  Uo  is  the  maximum  speed  of  the  flow  (see  Fig.  11.2  for  a  schematic  of  the 
problem).  We  assume  that  there  is  no  flux  of  the  solute  out  of  the  sides  of  the  domain, 
n  -  VC  =  0,  which  for  the  uniform  width  channel,  simplifies  to 


9C 

9Y 


Y=±H 


(11.18c) 


While  in  most  common  situations,  diffusion  occurs  at  the  same  rate  in  every  direction 
(so-called  isotropic  behaviour),  to  help  distinguish  different  effects,  we  label  the 
diffusion  coefficients  in  the  direction  of  and  transverse  to  the  imposed  flow  as  Dx 


Fig.  11.2  Schematic 
behaviour  of  solute 
dispersion  {green  contours) 
in  a  long  channel  starting 
from  a  small  release  area  due 
to  a  velocity  field  {red) 
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and  Dv  respectively.  In  this  anisotropic  context,  the  transport  equation  (1 1 . 1 8a)  takes 
the  form 

djC  +  V  •  (CV)  =0,  V  =  U  -  DVC, 


where  D  is  a  diffusion  coefficient  matrix,  D  =  |  ^ 

where  Dx  =  Dy  =  D,  this  equation  reduces  to  (11.18a). 
We  nondimensionalize  by  taking 


D  =  (»*  °  ) 
(  0  By)' 


For  the  isotropic  case, 


C  =  C0c,  X  =  Lx,  Y  =  H  y,  T  =  (L/U0)f, 

where  Co  is  a  concentration  scale  that  could  be  set  by  the  initial  conditions  and  L 
is  a  typical  length  scale  along  the  channel.  Consequently,  the  scaled  problem  on 
— oo  <  v  <  oo,  —  l<y  <1  becomes 


dc  2  dc  1  |  1  d2c 

dt+(1~y  -  Pef  9x2  +  p^9^2’ 

with  boundary  conditions  dyc\y—±\  =  0.  The  Peclet  numbers, 


(11.19) 


PeA  = 


U0L 


D 


X 


p  UoH2 
Pev  =  - , 

^  Dy  L 


give  the  relative  importance  of  the  convective  flow  along  the  channel  versus  diffusive 
effects  in  the  v  and  y  directions  respectively.  As  the  flow  carries  the  solute  along  the 
channel,  the  concentration  will  disperse  across  the  full  width  of  the  channel.  Taylor 
showed  that  the  average  concentration  across  the  channel  at  each  v  position  could 
be  estimated  from  a  reduced  model. 

We  note  that  the  concentration  can  be  separated  into  the  average  across  the  channel 
and  deviation  from  the  average 


c(x,  y,  t)  =  c(x,  t)  +  c(x,  y,  t), 


(11.20a) 


where  the  average  c  is  defined  by 


i  f1 

c(x,t)  =  -J  c(x,y,t)dy. 


(11.20b) 


A  consequence  of  this  definition  of  c  is  that  the  deviation  c  has  zero-mean,  i.e. 


c(x ,  v,  t)dy  =  0. 


l 


(11.20c) 
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Similarly,  we  can  separate  the  flow,  u(y)  =  1  —  y2,  into  mean  flow  and  deviation 
from  the  mean. 


u(y)  =  u  +  u(y), 


(11.21a) 


1 


l 


2  2  1  2 
u  =  ^  /  l~y  dy  =  3’  “00  =  3  “  ?  • 


(11.21b) 


The  PDE  (11.19)  can  now  be  expanded  as 


3c  dc  _dc  „  3c  _dc  „  3c 
~\~  —  +  u  —  T-  u  —  +  u  —  +  u 


3 1  3 1  dx 

1  32c 


dx 


dx 


dx 


1  32c  1  32c 

+  — — ^  + 


Pex  dx2  Pex  dx2  Pev  3y 


(11.22) 


This  equation  can  be  integrated  term  by  term  across  the  width  of  the  channel, 


1 

2  /_ 


(11.2)  dy, 


to  give  an  evolution  equation  for  c(x,  t).  The  integrals  of  the  deviations  vanish,  thus 

1  „  „ 
ctdy  =  u  cxdy  =  c  u  dy  =  0. 

-l  J- 1  J- 1 

The  boundary  conditions  on  c  yield  corresponding  no-flux  boundary  conditions  on 
c  and  hence  the  integral  of  the  last  term  in  the  PDE  also  vanishes, 


dc 

dy 


=  0 


1  3 2c  dc 

dy  = 


y=±i 


-1  dy- 


dy 


=  0. 


(11.23) 


The  remaining  terms  determine  the  evolution  of  the  mean  concentration, 


3c  _  3c  1  „  dc  1  32c 

—  T-  u  —  T-  —  /  u  —  dy  = - . 

3 1  dx  2  dx  Pex  dx2 


(11.24) 


In  the  absence  of  the  integral  term,  this  would  be  a  linear  advection-diffusion  equation 
for  c(x,  t).  While  the  integral  involves  only  perturbation  terms,  it  is  not  valid  to 
neglect  it  since  while  the  mean  of  each  deviation  factor  is  zero,  the  average  of  a 
product  is  generally  not  equal  to  the  product  of  the  averages.  Consequently,  additional 
analysis  on  the  properties  of  c  is  needed. 
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Subtracting  the  averaged  Eq.  (11.24)  from  the  full  problem  (11.22)  we  obtain  an 
equation  for  c(x,  y,  t). 


dc  „  dc 
—  H-  u  — 
dt  dx 


H-  u 


dc 

dx 


H-  u 


dc 

dx 


1 

2 


„  dc 
u —  dy 
dx 


l  d2c  l  d2c 
Pex  dx2  Pey  dy2 


(11.25) 


Taylor’s  approach  can  then  be  expressed  in  terms  of  two  assumptions:  (i)  the  deviation 
is  generally  smaller  than  the  mean,  c«c,  and  (ii)  the  channel  can  be  assumed  to  be 
slender  (similarly  to  problems  in  Chap.  8)  and  the  y-derivative  terms  will  dominate 
the  corresponding  v -derivatives,  dy  dx.  The  leading  order  dominant  balance  of 
the  largest  terms  from  the  left  and  right  sides  of  (1 1.25)  then  yields 


1  32c 
P Qy  dy2 


(11.26) 


Since  c  does  not  depend  on  y,  we  can  integrate  this  equation  once  with  respect  to  y 
to  get 


3c  3c 

=  Pe 


dy 


-y 


dx 


(T  -  3?3  +  Al(x’ 


Applying  the  no-flux  boundary  conditions  (11 .23)  at  y  =  ±1  determines  the  function 
of  integration  to  be  A\  (x,  t)  =  0,  so  we  can  proceed  by  integrating  again  to  yield 


3c  /  1  9  1  A 

c  =  Pey—  (  -y2  -  —  +  A2(x,  t) ), 


dx 


0 


12 


■) 


where  A  2  is  a  second  function  of  integration.  By  definition,  the  perturbation  must  have 
zero  mean,  namely,  1  c  dy  =  0.  Applying  this  condition  determines  A 2  =  —7/180 
and  hence  we  conclude  that 


c  =  Pe 


y 


dc 

dx 


(-y2-  — 
\6J  12 


y 


4 


7 


180 


) 


(11.27) 


We  can  now  use  this  to  calculate  the  integral  term  from  (1 1.24), 


1  [l  „dc  8Pey  d2c 

-  u —  dy  = - ~ 

2  J_x  dx  y  945  dx2 


Consequently,  we  can  complete  our  model  equation  for  the  evolution  of  the  average 
density  across  the  channel  (11.24)  as 
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dc  _  dc 
—  ~b  U  — 
dt  dx 


a  = 


8  U^H2\ 
945  D^Dy  J 


(11.28) 


where  a  is  called  the  Taylor  enhanced  diffusion  coefficient. 

In  comparison  with  the  original  two-dimensional  problem  (11. 19),  we  have 
obtained  a  reduced  model.  Equation  (11.28)  is  a  one-dimensional  linear  convection- 
diffusion  equation.  Considering  the  left  side  of  the  model,  we  can  identify  a  convec¬ 
tive  derivative  with  constant  speed  u  and  in  fact,  making  a  change  of  variables  with 
this  wave  speed,  c(x ,  t)  =  F(x  —  ut,t),  yields  the  classical  diffusion  equation  for  F , 


3  F  3  2F 

—  =  a — 
dt  3  7? 


(11.29) 


As  we  have  seen  in  Chap.  5,  this  can  be  solved  using  similarity  solutions,  the  one 
appropriate  for  a  point  source  (consistent  with  initial  conditions  describing  release 
of  solute  at  a  single  location)  is  F(z,  t)  =  Mo  exp(— z2  /  (4at))  /  y/Anat  and  hence 
we  can  predict  the  large-time  behaviour  as 


c(x ,  t)  ~ 


M0 

V^7exp 


—  Ut )2 

4  at 


(11.30) 


where  Mq  is  the  mass  of  solute  (determined  from  the  initial  conditions). 


11.4  Further  Directions 

In  this  chapter,  we  have  sought  to  present  approaches  for  obtaining  information 
about  the  behaviour  of  solutions  of  PDEs  that  do  not  rely  as  heavily  on  perturbation 
methods  as  some  of  the  previous  chapters.  The  derivation  of  the  Taylor  dispersion 
results  can  be  made  more  rigorous  in  terms  of  an  asymptotic  analysis  similar  to  that 
used  in  Chap.  8  (see  Fowler  [37,  pp.  222-223]  and  [58]). 


11.5  Exercises 

11.1  In  Chap.  5  we  derived  the  Gaussian  self-similar  solution  of  the  heat  equation 
pr  =  pxx .  The  most  general  form  of  this  solution  is 

,  Cl  (  (*-C3)2^ 
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which  contains  three  arbitrary  constants.  Evaluate  the  Mo,  M\ ,  M2  moment  integrals 
for  this  solution  and  determine  C\ ,  C2,  C3  in  terms  of  the  results  for  (1 1.2a,  1 1.2b). 
This  can  be  shown  to  select  the  t  — >  00  asymptotic  solution  of  the  initial  value 
problem  for  the  heat  equation  [108]. 

11.2  Consider  the  convection-diffusion-reaction  equation 


dp 

~dt 


+  4  p 


subject  to  the  following  conditions: 

_  2 

(a)  On  the  domain  —  00  <  v  <  00  with  initial  condition  p  (x ,  0)  =  e~x  and  boundary 
conditions  p  — >  0  as  \x\  — >  00.  Define  the  moment  integrals  as 


Mn{t)  = 


■00 


—00 


xn  p(x,  t)  dx, 


Find  Mo(t),  M2(t). 

(b)  For  the  same  equation  on  a  semi-infinite  domain,  0  <  v  <00,  with  initial  condi¬ 
tion  p(x,  0)  =  e~x  and  boundary  conditions 

p(0,  t)  =  1,  p(x  —>  00,  t)  =  0. 

Define  the  moment  integrals  as 


‘OO 


Mn(t)  =  I  x  p(x,t)dx, 
0 


Write  the  differential  equations  and  initial  conditions  for  dM^/dt  and  dM\/dt. 
Write  what  additional  information  you  would  need  in  order  to  solve  for  Mo(t) 
and 

11.3  Consider  the  reaction-diffusion  equation 


dp  d2p 
dt  dx2 


P, 


on  0  <  v  <  it  with  no-flux  boundary  conditions, 


dp 

dx 


=  0, 


x=0 


dp 

dx 


=  0, 


X=J T 


and  the  initial  condition,  p(x,  0)  =  f(x). 
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(a)  Derive  the  problem  for  the  evolution  of  the  mass  Mo(t ),  but  show  that  the  equa¬ 
tions  for  the  higher  moments  are  not  closed. 

(b)  Show  that  separation  of  variables  can  be  used  to  write  the  solution  in  the  form 


oo 

p(x,  t)  =  ape~Xkt  cos (kx). 
k=0 


Determine  the  coefficients  ap  in  terms  of  the  initial  condition.  Show  that 


p(x,  t)  ~ 


mt) 

7X 


as 


oo. 


(c)  If  the  domain  is  changed  to  be  —  oo  <  v  <  oo,  and  the  first  three  moment  integrals 
of  f{x)  converge,  the  exact  solution  can  be  written  as 


P-t  9 

p(x,  t )  =  - —  /  f(x  -  y)e~y  /(4r)  dy. 

v  t  J — oo 


Using  results  on  integrals  of  the  Gaussian  and  basic  properties  of  double  integrals, 
show  that  moments  of  this  formula  for  the  exact  solution  reproduce  the  results 
for  M0,  Mi ,  M2  that  can  be  obtained  from  using  the  PDE  alone. 


11.4  The  Von  Foerster/McKendrick  model  describes  the  evolution  of  populations 
where  age-distribution  of  individuals  is  of  interest  (called  an  age-structured  pop¬ 
ulation  model).  Consider  a  population  described  by  a  density  function  p(a,t)  of 
individuals  with  ages  a  >  0  [74].  Let  the  density  for  0  <  a  <  00  evolve  according 
to  the  transport  equation 


dp  dp 
dt  +  da 


(11.31) 


with  p(a  — >  00)  — >►  0  (describing  that  no-one  lives  forever). 

(a)  Suppose  that  the  new  births  at  any  time  are  proportional  to  the  total  population 
size, 

poo 

p(0,t)  =  /3  /  p(a,t)da. 

Jo 

Show  that  the  model  can  be  reduced  to  a  single  ODE  for  the  total  population, 
M()(t)  =  J^°p(a,t)da. 

(b)  One  possible  refinement  of  the  birth  condition  could  be  to  account  for  the  fact 
that  reproductive  activity  of  individuals  tends  to  decrease  with  increasing  age. 
Consider  the  revised  birth  condition 
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P(0,  t) 


e 


—3a 


p(a ,  t)  da. 


Cushing  [28]  describes  an  approach  that  can  reduce  age- structured  models  to 
ODE  systems  in  terms  of  weighted  moment  integrals.  Show  that  the  model  with 
this  birth  condition  can  be  reduced  to  a  system  of  two  ODEs  for 

pOO  pOO 

Mo(t)  =  /  pda ,  M\{t)  =  /  e~3apda. 

Jo  Jo 

11.5  In  Chap.  5  a  similarity  solution  of  the  porous  medium  equation  was  derived 
using  the  same  approach  that  was  used  for  the  heat  equation.  To  see  how  the  method 
of  moments  applies  to  nonlinear  equations,  consider 

dp  ^  /  3  dp  \ 
dt  dx  \  dx )  ’ 


with  p  — ^  0  for  | x |  — ^  oo  and  initial  condition  p(x,  0)  =  f(x)  >  0  on  —  oo  <  v  < 
00. 

(a)  Show  that  Mo,  M\  are  constant  for  this  problem. 

(b)  Show  that  while  the  moment  model  is  not  closed  for  M2,  M2C)  is  a  strictly 
increasing  function. 

1 1 .6  While  describing  the  spreading  of  the  solute  in  the  transverse  direction  depends 
on  the  presence  of  diffusion,  justify  the  curved  shape  of  the  solute  contours  sketched 
in  Fig.  11.2  using  the  method  of  characteristics  for  the  PDE 


dc 

Yt 


+  (i 


starting  from  the  initial  conditions 


c(x,  y,  t  =  0) 


\  -  (X2  +  y2)  X2  +  y2  <  l 
0  elsewhere 


Hint:  Recall  Exercise  2.8. 

11.7  Examine  how  the  Taylor  dispersion  derivation  changes  for  different  velocity 
fields: 

A 

(a)  for  the  uniform  “plug  flow”  U  =  Uol, 

(b)  for  linear  shear  flow  U  =  Uq(1  —  Y /H)l, 
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11.8  Derive  the  Taylor  diffusion  model  for  laminar  axisymmetric  flow  in  a  circular 
pipe  of  radius  R  =  Ro : 

3C  /  r2\ 3C  a2c  Dr  d  /„ac\ 

?f  +  “  R2  J  3X  ~DxdX2  +  R~3R  (R9r) 

with  no  flux  boundary  conditions  at  the  pipe  wall. 

11.9  The  key  detail  obtained  in  the  derivation  of  the  Taylor  diffusion  model  (11 .28) 
was  the  coefficient  8/945.  The  corresponding  correction  factor  in  Exercise  11.8  or 
for  other  geometries  will  be  different  constants.  For  the  problem  of  two-dimensional 
laminar  flows  (11.19),  some  books  and  articles  may  give  the  “magic  number”  2/105 
instead.  Show  that  this  result  does  not  clash  with  8/945  given  in  the  derivation  above. 
Hint:  Consider  a  different  choice  for  the  scaling  of  the  flow  velocity. 


Part  III 
Case  Studies 


Chapter  12 

Modelling  in  Applied  Fluid  Dynamics 


This  chapter  differs  substantially  from  the  previous  chapters  in  that  we  do  not  aim  to 
explain  any  new  methodologies  here.  Instead,  we  will  make  use  the  approaches  devel¬ 
oped  earlier  to  illustrate  two  case  studies  of  how  mathematical  models  of  physical 
problems  can  be  formulated  and  analysed. 

Our  two  examples  applications  come  from  problems  in  fluid  dynamics,  on  air 
bearing  sliders  and  wedge-rivulet  flow.  We  begin  by  deriving  a  common  core-model, 
called  lubrication  theory ,  that  forms  the  basis  for  both  applications.  While  the  deriva¬ 
tion  of  lubrication  theory  draws  extensively  on  the  methods  for  long- wave  asymptot¬ 
ics  that  were  introduced  in  Chap.  8,  we  will  not  attempt  to  individually  identify  the 
many  other  connections  between  the  material  in  this  chapter  and  the  methods  given 
in  the  other  earlier  chapters.  Having  this  background,  readers  should  be  comfortable 
with  the  presentations  here,  which  are  at  levels  that  are  typical  of  basic  modelling 
in  current  applied  research  and  case  studies  in  more  advanced  books  on  modelling 
[27,  37,  96].  For  further  examples  of  case  studies  in  mathematical  modelling  see 
[51,53,  69]. 


12.1  Lubrication  Theory 

The  Navier-Stokes  equations  comprise  the  fundamental  continuum  mechanics  math¬ 
ematical  model  for  the  dynamics  of  fluids  having  viscosity  (namely,  realistic  dissipa¬ 
tion  or  internal  friction).  These  partial  differential  equations  are  conservation  laws  for 
mass  and  momentum  of  fluids  and  universally  applicable.  However,  for  a  number  of 
analytical  and  computational  reasons,  they  can  be  very  challenging  to  solve  for  most 
problems.  Consequently,  to  make  progress,  most  applied  studies  make  use  of  spe¬ 
cific  forms  of  their  problems  to  further  reduce  the  Navier-Stokes  equations  to  more 
tractable  models.  For  problems  that  involve  slender  layers  of  viscous  fluids,  lubri¬ 
cation  models  can  be  derived  from  the  Navier-Stokes  equations  using  asymptotics 
in  terms  of  the  dominance  of  viscous  effects  and  the  slenderness  of  the  layer.  Dat- 
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ing  back  to  the  work  of  Osborne  Reynolds  (1842-1912),  such  models  have  become 
essential  for  advancing  the  analysis  of  problems  from  a  diverse  range  of  applications. 

We  begin  with  the  Navier-Stokes  equations  for  a  compressible  viscous  Newtonian 
fluid.  Written  in  vector  form,  the  continuity  equation  and  momentum  balance  are 

dp  „  ( 9V 

+  v  •  (pV)  =  o,  P(- 

where  ju  is  the  viscosity  coefficient,  p(X,  Y,  Z,  T)  is  the  fluid  density,  P(X,  Y,  Z,  T) 
is  the  pressure,  and  the  fluid  velocity  field  is  V(X,  Y,  Z,  T)  =  (U,  V,  W).  Expanded 
out  componentwise  (12.1)  yields  a  system  of  four  coupled  nonlinear  PDEs, 

PT  +  (pU)x  +  (pV)y  +  (pW)z  =  0,  (12.2a) 

/5(Ut  +  UUx  +  VUy  +  WUz)  =  — Px  +  M(Uxx  +  Uyy  +  U  zz),  (12.2b) 

p(V  t  +  UVx  +  VVy  +  WVz)  =  —  Py  +  m(Vxx  +  Vyy  +  Vzz),  (12.2c) 

p(Wj  +  UWx  +  VWy  +  WWz)  =  —  Pz  +  +  Wyy  +  VVzz)-  (12. 2d) 

For  convenience,  we  simplify  our  presentation  to  the  two-dimensional  problem1 
where  W  =  0  and  all  properties  are  independent  of  Z  (i.e.  dz  =  0)  and  hence 
consider  the  system, 

Pt  +  (pU)x  +  (pV)y  =  0,  (12.3a) 

P(  UT  +  UUx  +  VUy)  =  -Px  +  /x(UXy  +  Uyy),  (12.3b) 

y5(V  T  +  UVx  +  VVy)  =  -Py  +  /x(Vyy  +  Vyy).  (12. 3c) 


+  V  •  VV  I  =  -  VP  +  /zV2V, 


(12.1) 


We  consider  the  dynamics  of  a  thin  layer  of  viscous  fluid  spreading  on  a  flat  solid 
surface,  as  shown  in  Fig.  12.1,  with  the  average  height  given  by  H  and  the  lateral 
lengthscale  given  by  L.  We  formally  nondimensionalize  using  the  scalings 

X  =  Lx,  Y  =  Hy,  T  =  ft,  (12.4a) 

\J  =  lJu,  V  =  Vv,  P  =  P  p,  p  =  pp.  (12.4b) 


We  go  on  to  use  the  aspect  ratio  s  =  H/L  as  an  asymptotic  parameter,  s  — >  0. 
Applying  the  scalings  to  (12.3a)  yields  the  nondimensional  form 


dp 

~dt 


+ 


t-(Pm)  + 
dx 


—  (pv)  =  0. 

dy 


Conservation  of  mass  is  a  fundamental  property  and  it  should  hold  exactly,  inde¬ 
pendent  of  the  geometry  or  any  considerations  of  flow  speed.  Hence,  independent 
of  the  final  choice  of  characteristic  scales,  we  should  retain  the  general  form  given 


i 


The  derivation  for  the  full  three-dimensional  system  follows  analogously. 
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Fig.  12.1  A  thin  him  of  a 
viscous  fluid  coating  a  hat 
solid  surface 


A 


Y 


L 

- 


Y  =  H(X,  T) 


by  (12.3a).  This  motivates  selecting  scales  to  make  the  continuity  equation  scale- 
invariant;  namely  we  pick  a  convective  timescale  T  =  L/U  and  a  vertical  velocity 
scale  based  on  the  horizontal  velocity  and  the  aspect  ratio,  V  =  eU,  to  yield  the  final 
dimensionless  equation, 


dp  d  d 

/  +  —  (pu)  +  —  (pv)  =  0.  (12.5) 

dt  dx  dy 

Turning  to  the  momentum  equations,  using  the  scalings,  the  non-dimensional 
forms  of  (12.3b,  12.3c)  are 


(■ 


2  pUh\ 


) 


(Ut  +  uux  +  VUy)  =  —  (  8 


0 


2  ydUL\ 


[l 


) 


Of  +  UVX  +  VVy)  =  - 


V  im) 

/PLN 

V/iU/ 


Px  T  £  T  ^yy?  (12.6a) 


Py  T  £  Vxx  T  ^yy- 


(12.6b) 


Apart  from  the  aspect  ratio,  these  equations  contain  two  dimensionless  parameters. 
The  reduced  Reynolds  number  gives  the  ratio  of  effects  of  inertial  acceleration  relative 
to  viscous  forces. 


~  9  pUL 

Re  =  s 2 - <  1, 

/x 


(12.7) 


To  assert  the  dominance  of  viscous  effects,  we  assume  that  this  parameter  is  negligibly 
small. 

Consequently,  (12.6a,  12.6b)  reduces  to 


0  =  - 

0  =  - 


(•f) 

< 


li  U 
uV) 


Px  T  £  UXX  T  Uyy, 


Py  T  £  Vxx  T  Vyy  ■ 


(12.8a) 

(12.8b) 
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We  note  that  this  reduction  of  the  Navier  Stokes  equations  (having  removed  the 
nonlinear  inertial  acceleration  terms)  is  called  the  Stokes  equations ,  and  for  the 
incompressible  (constant  density)  case,  their  dimensional  form  is 

0  =  -VP  +  MV2V,  V-V  =  0.  (12.9) 


The  choice  of  the  pressure  scaling  in  (12.8a,  12.8b)  remains.  One  option  is  to 
apply  dominant  balance  to  the  y -momentum  equation,  yielding  the  derived  scale 
P  =  /xU/L  =  0(1),  which  is  called  a  viscous  pressure  scale  since  it  balances 
viscous  and  pres  sure- gradient  effects.  However  it  can  be  shown  from  the  resulting 
leading  order  v-equation,  uoyy  =  0,  that  for  many  problems,  this  choice  can  be  too 
restrictive  and  may  to  lead  to  solutions  that  will  not  capture  the  full  structure  of 
the  flow.  The  other  option  is  dominant  balance  in  the  v -equation,  yielding  another 


version  of  the  viscous  pressure  scale,  P  =  /x U/(£2L)  =  0(s  2).  This  leads  to  the 
leading  order  system, 


dpo  _  82«0  dpo 

dx  3 y2  ’  3 y 


(12.10) 


The  second  equation  determines  that  po  is  independent  of  y,  namely  po  =  po(x,t). 
Consequently  the  first  equation  can  be  integrated  with  respect  to  y  to  yield  a  parabolic 
form,  sometimes  called  the  Nusselt  velocity  profile. 


U0=l~y2  +  Ciy  +  C2,  (12.11) 

where  C\,  C2  are  constants  of  integration  with  respect  to  y.  These  constants  are 
determined  by  boundary  conditions  on  the  lateral  velocity  u  at  the  top  and  bottom  of 
the  fluid  layer  for  the  particular  problem  at  hand. 

For  viscous  fluids  in  contact  with  solids,  the  no- slip  boundary  condition  states 
that  the  velocity  of  the  fluid  tangential  to  the  solid  must  match  that  of  the  solid.  In  this 
problem  the  solid  surface  is  y  =  0,  and  the  no-slip  condition  specifies  that  uo(0)  =  0, 
which  determines  C2  =  0  in  (12.11).  At  the  surface  of  the  layer,  y  =  h(x,  t),  if  the 
fluid  were  not  subjected  to  any  forces  (surface  stresses)  then  the  lateral  speed  at  the 
surface  should  be  the  same  as  the  bulk  of  the  fluid;  the  simplest  form  of  the  stress -free 
boundary  condition  is  then 


=  0  =>>  C\  = —pox(x,t)h(x,t).  (12.12) 

Consequently,  we  have  determined  that  for  films  spreading  on  flat  surfaces, 

u0(x,  y,  t)  =  jPOxiy2  -  2hy). 


duo 

dy 


(12.13) 
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To  complete  the  derivation  of  the  lubrication  model,  we  return  to  the  equation  for 
the  conservation  of  mass,  (12.5),  and  integrate  it  over  the  thickness  of  the  layer, 


*/z  r 


dp  d  d 

—  +  — ( pu )  +  —  (pv) 
dt  dx  dy 


dy  =  0. 


The  final  term  can  be  integrated  directly  to  give  a  relation  between  the  fluxes  through 
the  bottom  and  top  of  the  layer, 


y=h 


pv 


y= o 


(12.14) 


At  y  =  0,  the  fluid  rests  on  an  impermeable  solid  surface,  so  there  should  be  no  flux 
through  it,  hence  the  no-flux  boundary  condition  gives  v(y  =  0)  =  0.  At  the  top  of 
the  layer,  the  fluid  has  a  free  boundary,  and  the  vertical  component  of  the  velocity 
there  should  follow  the  motion  of  the  surface  y  =  h(x,  t).  Consider  a  particle  on  the 
surface  that  is  carried  by  the  flow,  having  position  ( X  ( t ),  Y ( t )),  and  remains  on  the 
surface  for  all  times,  so  Y ( t )  =  h(X(t),  t).  Then,  using  the  chain  rule  and  recalling 
the  Lagrangian  description  of  the  velocity,  the  rate  of  change  of  the  vertical  position 
of  any  point  on  a  free  boundary  is  given  by 


dY  dh  dh  dX 
dt  dt  dx  dt 


v 


y=h 


dh  dh 

-  H-  ll - 

dt  dx 


y=h 


(12.15) 


which  is  called  the  kinematic  boundary  condition.  Applying  these  two  boundary 
conditions  to  (12.14)  yields 


/  dh 

dh\ 

rh 

dp 

d 

Car 

+  pu 

+  / 

+  „  (pu) 

dx  ) 

o 

-s: 

II 

dt 

dx 

dy  =  0. 


Applying  Leibniz’s  rule  (in  reverse,  with  respect  to  v  and  t  separately)  then  gives  a 
transport  equation  describing  conservation  of  mass 


d 

dt 


(12.16) 


If  we  take  p  =  p(x,  t)  then  this  reduces  to 
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Finally,  using  the  velocity  profile  (12.13)  to  evaluate  the  integral  yields 


d(ph) 

dt 


(12.18) 


This  is  a  form  of  the  Reynolds  equation  that  forms  the  basis  of  lubrication  models 
in  a  wide  array  of  applications  involving  free-surface  thin  film  flows  [25,  75,  79,  82] 
and  fluid- structure  interactions  involving  lubricating  fluid  layers  [12,  46]. 


12.2  Dynamics  of  an  Air  Bearing  Slider 

One  of  the  historical  motivations  for  the  development  of  lubrication  theory  has  been 
the  use  of  fluids  as  cushioning  layers  between  moving  surfaces — an  everyday  appli¬ 
cation  is  oil  lubricating  parts  of  an  engine  to  allow  relative  motion  without  metal 
surfaces  scratching  against  each  other. 

Lubrication  theory  is  also  used  in  modern  technology  [12] — in  computer  hard 
disk  drives,  air  acts  as  a  lubricating  gas  layer  separating  the  electronic  read/write 
head  from  the  rapidly  rotating  rigid  data  disks.  To  maximize  the  data  density  on  the 
disk,  the  write  head  must  be  kept  very  close  to  the  surface,  and  to  maximize  data 
access  speed,  the  disk  speed  should  be  high.  However  both  of  these  effects  might 
suggest  the  system  could  be  sensitive  to  any  variations  in  disk  speed,  external  forces 
and  motion  of  the  system,  or  variations  of  pressure  that  might  lead  the  head  to  collide 
with  the  disk.  Hence  it  is  very  important  to  have  a  model  of  the  system  that  can  guide 
the  design  process  to  configurations  that  are  stable  to  perturbations. 

The  one-dimensional2  version  of  the  geometry  we  are  considering  is  shown  in 
Fig.  12.2.  The  read/write  head,  sometimes  also  called  a  slider  bearing  [79],  has 
a  particular  length  L  and  has  its  lateral  position  fixed,  but  it  is  allowed  to  move 
vertically.  The  disk  is  a  flat  surface  (Y  =  0)  moving  horizontally  at  local  speed  U. 
The  motion  of  the  disk  will  generate  a  flow  of  air  under  the  slider  due  to  the  no- slip 
boundary  condition  on  the  gas.  This  flow  will  generate  a  lift  force  on  the  slider  that 
will  balance  against  the  weight  of  the  slider  and  any  other  applied  downward  forces. 
The  desired  average  gap  height  H  sets  a  vertical  lengthscale.  The  gap  height  between 
the  disk  and  slider  surfaces  is  given  by 

H(X,  T)  =  A(T)  +  S(X),  (12.19) 

where  A  is  the  vertical  position  of  the  leading  edge  of  the  slider  and  S(X)  gives  the 
shape  of  the  slider’s  lower  surface,  with  S(0)  =  0.  Outside  of  the  region  under  the 
slider,  the  air  pressure  will  be  assumed  to  be  the  usual  atmospheric  pressure,  Patm- 


2 Referring  to  the  number  of  lateral  dimensions. 
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Fig.  12.2  The  geometry  of  the  problem  for  an  air  bearing  slider.  Lubrication  theory  can  be  applied 
to  model  the  air  flow  in  the  thin  gap  between  the  moving  disk  and  slider  surfaces 


Under  the  assumption  that  the  reduced  Reynolds  number  is  small,  (12.7),  and  that 
the  aspect  ratio  of  the  gap  is  small,  s  =  H/L  <<C  1,  lubrication  theory  can  be  applied. 
Written  in  dimensional  form,  the  parabolic  velocity  profile  (12.11)  in  the  gap  is 


J_^y2 
2 n  ax 


+  Cl  Y  +  C2. 


(12.20) 


The  scalings  and  boundary  conditions  appropriate  to  this  problem  must  be  applied 
to  this  general  form.  At  the  disk  and  slider  surfaces,  no- slip  boundary  conditions 
impose  the  lateral  speeds  on  the  gas  flow, 


U(Y  =  0)  =  U,  U(Y  =  H)  =  0. 


These  conditions  select  the  constants  of  integration  in  (12.20)  to  yield 


U  = 


1  9P 


iii  ax 


(Y2  -  HY)  +  U  1  -  - 


(12.21) 


where  we  have  separated  the  contributions  due  to  the  pressure-gradient  driven 
Poiseuille  flow  term  and  the  linear  shear-flow  Couette  flow  term.  The  next  step 
is  to  use  this  velocity  profile  in  the  equation  of  conservation  of  mass  to  derive  the 
appropriate  form  of  the  Reynolds  equation  for  this  problem. 

While  Eq.  (12.16)  was  derived  under  a  different  type  of  boundary  condition  on  the 
upper  surface  of  the  fluid  layer,  it  can  be  shown  that  the  same  equation  is  obtained 
when  the  kinematic  boundary  condition  is  replaced  by  a  no-flux  condition  appropriate 
to  the  interface  with  the  slider  surface  at  Y  =  H  [46].  This  could  be  expected  from 
physical  considerations  of  (12.17)  as  a  transport  equation;  since  the  mass  of  air  in 
any  column,  pH,  is  conserved,  its  rate  of  change  can  only  be  due  to  transport  to  other 
positions  in  the  gap  by  the  flux.  Consequently,  we  obtain  the  dimensional  equation 


9 (pH)  U  9(pH)  =  1  a  /  3  9P\ 

9T  +2  ax  12/1  ax  v  ax/' 


(12.22) 
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To  nondimensionalize  this  equation,  we  need  to  select  characteristic  scales.  For  the 
velocity  scale,  it  is  natural  to  use  the  imposed  disk  speed  U.  For  the  lateral  length 
scale,  the  length  of  the  slider,  L,  is  a  convenient  choice;  the  derived  convective 
timescale  is  then  T  =  L/U.  In  the  vertical  direction,  the  reference  height  H  can 
be  used.  Equation  (12.22)  is  linear  with  respect  to  p,  so  whatever  the  choice  of  the 
characteristic  density,  it  will  scale  out.  The  boundary  conditions  on  the  pressure  at  the 
edges  of  the  slider  give  an  imposed  pressure  scale,  P  =  Patm-  Using  these  scalings 
in  (12.4)  and  writing  H  =  H h  we  obtain  the  dimensionless  equation  [30,  46] 

d(ph)  i9(pA)  =  ]_l_  /  3dp\ 

dt  2  dx  A  dx  \  dx )  ’ 

containing  one  dimensionless  parameter,  called  the  bearing  number , 


12/xLU 

PatmH2  ’ 


(12.23) 


which  gives  the  ratio  of  the  relative  effects  of  the  convective  (Couette)  to  diffusive 
(Poiseuille)  terms,  analogous  to  a  Peclet  number.  For  applications  of  interest,  small 
gap  heights,  small  aspect  ratios  and  high  speeds  all  point  to  large  bearing  numbers; 
hence  we  define  8  =  1/ A  as  a  small  parameter. 

To  complete  the  formulation  of  the  model,  we  need  to  specify  a  relation  between 
the  pressure  and  the  gas  density.  The  simplest  choice  for  the  equation  of  state  is  the 
ideal  gas  law,  which  states  that  the  pressure  is  proportional  to  the  product  of  the 
density  with  the  temperature.  Assuming  the  temperature  to  be  held  fixed,  this  gives 
p  =  kp  and  reduces  the  Reynolds  equation  to  a  PDE  for  the  evolution  of  the  product 
p(x,  t)h(x,  t). 


d(ph)  i  d(ph) 

dt  2  dx 


(12.24a) 


on  the  domain  0  <  x  <  1  and  is  subject  to  the  boundary  conditions  on  the  pressure, 


p(0,t)  =  l,  =  (12.24b) 


For  5^0,  this  is  a  singularly  perturbed  boundary  value  problem  and  can  be  shown 
to  generate  a  boundary  layer  in  the  pressure  at  v*  =  1 ,  the  trailing  edge  of  the  slider. 
On  the  rest  of  the  domain,  0  <  x  <  1,  we  can  approximate  the  solution  by  the 
solution  of  the  leading  order  outer  problem, 


d(pph)  i  d(pph) 

dt  2  dx 


(12.25) 


This  is  a  linear  advection  equation  and  yields  the  solution  as  a  traveling  wave  with 
speed  one  half,  F(x  —  ^t),  which  can  also  be  expressed  as 
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Po(x,  t)h(x ,  t)  =  fit  -  2x), 


(12.26) 


for  some  choice  of  function  f(s).  Applying  the  boundary  condition  on  the  pressure 
at  x  =  0  and  evaluating  the  gap  height  there,  from  (12.19),  h(0,t )  =  a(t ),  and 
we  determine  fit)  =  a  it).  Consequently  the  solution  of  this  signaling-type  wave 
problem  for  the  pressure  can  be  expressed  as 


poix,  t) 


af  —  2x) 
af)  +  ^(v)  ’ 


(12.27) 


from  p o  =  f/h.  We  can  now  use  this  representation  of  the  pressure  to  return  to  the 
primary  question  of  the  stability  and  dynamics  of  the  slider. 

The  motion  of  the  slider  is  characterized  by  the  height  of  the  leading  edge, 
y  =  af),  hence  we  seek  an  evolution  equation  for  af).  This  will  be  provided 
by  a  force  balance  in  the  vertical  direction.  As  described  above,  the  air  flow  gener¬ 
ates  a  lift  force  on  the  slider,  this  is  given  by  the  integral  of  the  excess  pressure  over 
the  domain  of  the  slider, 


FlO)  =  [  ipix,t)  -  \)dx. 

J  o 


This  balances  against  the  downward  applied  load  imposed  on  the  slider  by  structural 
constraints,  Fs  to  yield  the  force  balance  equation 


m 


af  —  2x) 
af)  +  s(v) 


dx  -  iFs  +  1), 


(12.28) 


where  m  is  the  scaled  mass  of  the  slider.  We  note  that  by  neglecting  the  trailing  edge 
boundary  layer  in  the  pressure,  we  only  make  a  small  error,  (9 (A-1)  in  calculating 
the  lift  integral. 

We  can  obtain  a  time-independent  steady-state,  a  =  a,  by  solving  an  algebraic 
equation, 


a 


dx 

a  +  six) 


=  FS  +  1. 


Then,  we  can  examine  the  linear  stability  of  this  steady  solution  to  small  deviations 
by  assuming  a  perturbed  solution,  af)  =  a  +  aeXt ,  for  o  — >►  0.  Substituting  this 
form  into  (12.28)  yields  an  equation  for  the  exponential  growth  rate  X  at  (9(cx), 


mX 


2 


g—2Xx 

- - dx  — 

a  +  six) 


a 

id  +  s(v))2 


dx. 


(12.29) 


If  the  governing  equation  for  af)  were  an  autonomous  ODE,  then  A  would  be  given 
by  the  roots  of  a  characteristic  polynomial.  However  due  to  the  unusual  shifted  depen- 


244 


12  Modelling  in  Applied  Fluid  Dynamics 


dence  on  the  solution  in  the  integral,  (12.28)  can  be  related  to  a  delay-differential 
equation  and  yields  a  more  challenging  transcendental  equation  for  the  linear  stability 
analysis.  Careful  consideration  of  this  problem  was  explored  in  [109]. 


12.3  Rivulets  in  a  Wedge  Geometry 


In  the  previous  section,  we  considered  a  scenario  where  the  substrate  was  uniformly 
flat.  Here,  we  investigate  a  lubrication-type  problem  for  a  rivulet  (slender  thread) 
of  viscous  fluid  constrained  within  a  wedge  geometry3  (see  Fig.  12.3).  In  particular, 
we  will  study  the  so-called  large-time  dynamics  of  the  problem  using  asymptotic 
approximations  and  self-similar  solutions.  As  well  as  being  a  well-observed  phys¬ 
ical  phenomena,  interesting  in  its  own  right,  the  results  that  we  obtain  also  have 
application  to  the  study  of  foam  drainage,  where  the  flow  takes  place  within  the  “tri¬ 
angular”  shaped  region  of  the  Plateau  borders4  (see  [103]  for  a  general  discussion 
of  the  dynamics  of  foams). 

We  shall  take  X  to  represent  the  distance  along  the  wedge,  with  Y  as  the  vertical 
upward  distance  from  the  base  of  the  wedge,  and  Z  as  the  transverse  distance  from 
the  centreline  of  the  wedge  (along  with  the  respective  velocities  U,  V  and  W). 

We  begin  with  the  dimensional  Navier- Stokes  equations  in  three-dimensions 
(12.2);  the  dimensional  density  p  is  taken  to  be  constant.  We  apply  the  nondimen- 
sionalisation 

X  =  Lx,  Y  =  Hy,  Z  =  Hz,  T  =  ft,  (12.30a) 

U  =  Vu/e,  V  =  Vv,  W  =  Vw,  P  =  P  p,  (12.30b) 

where  H  is  a  typical  fluid  depth  and  V  =  s2y //a  is  a  representative  velocity  scale 
(here,  y  is  the  coefficient  of  surface  tension  for  the  free  surface  of  the  liquid).  The 
expressions  (12.30)  imply  that  we  consider  the  magnitude  of  the  flows  in  the  vertical 
and  transverse  directions  to  be  of  the  same  order,  and  (as  in  the  previous  section)  the 
aspect  ratio  e  =  H/L  «  1.  After  substitution  and  collecting  terms  in  orders  of  e,  the 
leading  order  equations  are  found  to  be 


dp 

dx 


d2u  d2u 
dy 2  dz 2  ’ 


(12.31) 


being  a  reduced  version  of  the  Stokes  equations  (12.9).  The  last  two  equations  of 
(12.31)  show  that  to  leading  order  the  pressure  field  is  independent  of  both  y  and  z, 
and  hence  p  =  P(x,t).  Furthermore,  geometric  considerations  can  be  used  to  show 


3 This  analysis  is  based  on  the  paper  [16]  to  which  we  refer  the  interested  reader  for  more  details. 

4The  sides  of  a  Plateau  border  are  actually  circular  arcs  and  there  is  no  free-surface,  but  the  model 
equations  that  we  derive  here  are  still  applicable.  This  is  an  example  of  how  modelling  and  inves¬ 
tigating  one  problem  can  also  provide  useful  information  for  other  related  problems. 
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Fig.  12.3  Schematic  of  the  problem  geometry 


that  P  oc  A-1/2,  where  A(x,t)  is  the  cross-sectional  area  of  the  rivulet  at  location 
v  and  time  t.  The  first  equation  of  (12.31)  can  be  solved  subject  to  no- slip  boundary 
conditions  on  the  substrate  and  pressure  jump  conditions  across  the  free  surface  of 
the  liquid  to  yield  u  a  A2dP/dx. 

The  conservation  of  mass  result  for  the  cross-sectional  area  A(x,  t )  takes  the  form 


dA  d 
dt  +  dz 


(12.32) 


where  D  is  the  area  of  the  (y,  z)  plane  occupied  by  liquid  at  fixed  v  and  t .  Substituting 
in  the  relevant  expressions  for  u  and  P  from  above  consequently  results  in  a  PDE 
for  the  evolution  of  A(x,  t)  which  takes  the  (porous-medium-equation-type)  form 


dA 

~dt 


0  <  v  <  oo 


(12.33) 


and  is  the  basic  equation  that  we  will  further  analyse.  For  initial  data,  we  impose 


A(x,  0)  =  « 


M 

0 


0  <  x  <  l, 
x  >  1, 


(12.34) 


corresponding  to  a  constant  cross-section  of  fluid  over  0  <  v  <  1  with  a  dry  wedge 
for  v  >  1;  the  initial  volume  of  liquid  is  consequently  also  given  by  M. 

Once  the  liquid  is  in  motion  ( t  >  0),  there  will  be  a  moving  free-boundary 
x  =  s(t)  (with  A  =  0  for  v  >  s(t )),  on  which  we  impose  the  physically  sensible 
conditions  A  =  0  (zero  height)  and  A1/2  Ax  =  0  (zero  flux  across  the  interface).  At 
x  =  0,  we  also  impose  zero  flux  (corresponding  to  a  solid  wall),  so  that  the  total 
mass  of  liquid  in  the  wedge  is  conserved  for  all  time,  i.e. 
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j  rs(t)  rs(t)  rs(t ) 

—  /  A(x,t)dx  =  0  =>-  /  A(x,t)dx=  /  A(x,0)dx=M. 

dt  Jo  Jo  Jo 

(12.35) 

The  nonlinear  diffusion  equation  (12.33)  coupled  with  the  conservation  law 

(12.35)  admits  a  similarity  solution 


A  =  t  2/5  f(r]),  j]  = 


where  f(rj)  satisfies 


with  solution 


(12.36) 


(12.37) 


the  subscripted  plus  sign  here  denotes  that  we  are  only  interested  in  the  non-negative 
part  of  the  solution,  with  the  moving  interface  v  =  s(t)  corresponding  to  rj  =  tjq  in 
terms  of  the  similarity  variables.  Of  course,  (12.36)  is  singular  in  the  limit  t  — >  0+ 
and  so  cannot  satisfy  the  initial  condition  (12.35).  However,  it  is  well-known  that 
similarity  solutions  often  act  as  large-time  attractors  for  the  dynamics  of  nonlinear 
diffusion  equations  and  so  we  can  expect  A(x ,  t)  to  approach  (12.36)  and  (12.37)  at 
large  times. 


12.3.1  Imbibition  in  a  Vertical  Wedge 


Suppose  now  that  we  orient  the  wedge  so  that  it  is  vertical,  with  the  base  located 
at  v  =  0.  In  such  a  geometry,  we  can  no  longer  neglect  gravity  and  it  will  be  the 
competing  effects  of  gravity  (acting  downwards)  and  capillarity  (acting  upwards) 
that  drives  the  motion  of  the  fluid. 

Taking  gravity  to  act  equally  everywhere  on  the  fluid  leads  to  a  convective  flow  in 
the  downward  direction  that  requires  the  evolution  Eq.  (12.33)  to  be  adjusted  to  read 


3  A 
~dt 


-2  A 


3  A 
dx 


0  <  x  <  oo; 


(12.38) 


we  note  that  the  left-hand- side  terms  of  (12.38)  correspond  to  a  quasilinear  convection 
equation  that  can  be  tackled  by  the  method  of  characteristics  (starting  from  the  initial 
data  (12.34))  and  shown  to  have  smooth  solutions  for  all  t  >  0. 
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Physically,  we  expect  the  amount  of  liquid  that  can  be  pulled  up  the  wedge  by 
capillarity  to  be  small  compared  to  the  bulk  mass.  We  therefore  assume  that  the 
solution  in  the  latter  region  will  effectively  be  a  steady-state  solution  of  (12.38)  and 
consequently  find 

4 

A  =  - 7T  for  v  =  0(1).  (12.39) 

(x  +  xo  Y 


The  value  of  xo  is  set  (to  leading  order)  by  the  mass  constraint  (12.35),  by  which  we 
calculate  xo  =  4/M.  The  steady-state  inner  solution  (12.39)  will  match  to  the  outer 
solution  governed  by  capillary  action. 

In  the  capillary  flow  region,  we  cannot  neglect  the  time-dependence  in  (12.38). 
We  do  not  attempt  to  derive  an  exact  solution  of  (12.38),  but  note  that  it  admits  a 
similarity  solution  of  the  form 

A  —  t~2/3g(/i),  n=-^,  (12.40) 

with  giii)  satisfying 

~\8  ~  \,l8lL  =  (gl/2g^  -  g2)^  • 

Close  to  x  =  sit),  with  sit)  =  /zpf1/3  in  terms  of  the  similarity  variables, 


8  ~ 


/z)  as  ii  —>  /z0 


so  that  the  solution  has  zero  contact  angle  at  /z  =  /zo.  As  /z  — >  0+  we  must  match 
with  (12.39)  as  x  — >  oo  and  this  requires  that 


8  ~ 


as 


A  schematic  of  the  overall  (large-time)  solution  structure  is  shown  in  Fig.  12.4. 


Fig.  12.4  Schematic  of  the 
large-time  solution  for 
gravity  opposing  capillarity 
based  on  the  solution  of 


Eq.  (12.38) 


A 


A 


0(1) 


-2/3 


) 


0(1) 


s  (t) 
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12.3.2  Draining  in  a  Vertical  Wedge 


In  this  scenario,  v  =  0  is  taken  to  be  at  the  top  of  the  wedge,  with  the  liquid  moving 
downwards  (again  starting  from  (12.34))  under  the  now  combined  effects  of  gravity 
and  capillarity.  The  evolution  equation  takes  the  same  form  as  (12.38),  except  for  a 
different  sign  on  the  convective  term 


9 A  dA  d  (  i  n  dA\ 

—  +2 A —  =  — |A1/2 — ),  0  <  x  <  oo;  (12.41) 

dt  dx  dx  \  d x) 

notably,  in  contrast  to  the  case  of  imbibition,  the  quasilinear  equation  allows  shocks 
to  form  and  we  will  observe  this  phenomena  in  the  large  time  solution  to  (12.41). 

Away  from  v  =  0,  the  dynamics  of  the  ( outer  region )  solution  are  primarily 
controlled  by  the  effects  of  gravity  and  the  reduced  PDE  is  consequently  given  by 


dA 

~dt 

with  a  solution  of  self- similar  form 


-2A 


dA 

dx 


A  =  rl/2h(v), 


Here,  h(v)  satisfies 


so  that 


h 


\v  0  <  V  <  Vo, 

0  V  >  Vq. 


(12.42) 


(12.43) 


(12.44) 


Standard  matching  arguments  imply  that  A  ~  z/2t  as  v  ->  0+,  while  conservation 
of  mass  (12.35)  determines  the  shock  location  vo  =  2\[M.  Across  the  shock,  the 
effects  of  capillarity  are  important  and  smooth  the  solution  over  a  narrow  interior 
layer  with  scalings 


v  =  s(t)  +  £1/4§, 


s^vot1/2,  A~f  1//20(§). 


As  a  result,  we  find 


(12.45) 


which  corresponds  to  a  travelling  wave  solution  of  (12.41). 
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Fig.  12.5  Schematic  of  the  large-time  solution  for  gravity  assisting  capillarity  based  on  the  solution 
of  Eq.  (12.41) 


The  ( inner  solution )  behaviour  close  to  x  =  0  is  again  given  by  (12.40),  with 

g  ~  ii / 2  as  fi  — >  oo  in  order  to  match  with  the  outer  solution.  Finally,  as  u  — >  0+, 

3 /2 

we  have  g  go  +  go  v  as  a  consequence  of  the  zero-flux  boundary  condition 
imposed  at  v  =  0;  numerical  calculations  yield  go  ~  0.5885.  The  entire  large-time 
solution  structure  is  illustrated  schematically  in  Fig.  12.5. 

A  number  of  extensions  of  this  problem  are  possible,  such  as  the  introduction 
of  an  influx  of  liquid  at  v  =  0  and  the  investigation  of  different  geometries  for  the 
wedge  shape  (see  [16]  for  a  detailed  description  of  the  previous  analysis  and  further 
extensions). 


Epilogue 


The  approaches  and  methods  described  in  this  book  have  long  histories  and  have  been 
employed  to  make  major  advances  in  research  on  many  challenging  fundamental 
problems.  These  approaches  to  modelling  and  problem  reduction  still  form  the  basis 
for  a  great  deal  of  current  research  on  more  advanced  problems.  As  one  example,  the 
quasi-steady  state  assumption  from  Chap.  10  is  used  as  the  basis  for  handling  large 
systems  of  complex  bio-chemical  reactions. 

Current  directions  in  modelling  of  physical  systems  have  also  put  a  new  focus  on 
complex  systems  [86]  and  multiscale  modelling  [33].  These  are  models  that  seek  to 
constructively  incorporate  layers  of  understanding  that  come  from  different  physi¬ 
cal  scales,  often  referred  to  as  micro-,  meso-,  and  macro-scales.  Examples  include 
(i)  improved  models  for  materials  properties  in  continuum  model  settings  that  are 
derived  from  microscale  models  at  the  atomic  scale  and  (ii)  systems  biology  models 
of  organs  or  entire  physiological  systems  building  from  models  of  cells  and  biochem¬ 
ical  reactions. 

One  of  many  active  forums  for  current  work  on  modelling  is  the  journal  Multiscale 
Modeling  and  Simulation  published  by  the  Society  of  Industrial  and  Applied  Math¬ 
ematics  (SIAM).  Many  multiscale  models  make  use  of  asymptotic  approaches1  to 
implement  matching  of  descriptions  at  different  scales  in  combination  with  numer¬ 
ical  computations  to  make  progress  on  fundamental  questions  in  a  broad  array  of 
application  areas.  We  hope  that  this  book  has  given  readers  a  strong  starting  point 
for  moving  on  to  such  advanced  and  challenging  problems. 

TW  and  MB 


including  boundary  layers  and  other  extensions  of  multiple  scales,  like  averaging  [73,  102]  and 
homogenisation  theory  [49]. 
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Appendix  A 

Trigonometric  Identities  and  Fourier  Series 


A  brief  summary  of  useful  trigonometric  identities: 


sin2  x  +  cos2  x  —  l 


sin  x  cosy  +  cos  x  sin  y  =  sin(x  +  y)  cos  x  cosy  —  sinx  siny  =  cos(x  +  y) 
sin x  cosy  =  ^  (sin(x  —  y)  +  sin(x  +  y))  sinx  siny  =  ^  (cos(x  —  y)  —  cos(x  +  y)) 

cos x  cos y  =  i  (cos(x  —  y)  +  cos(x  +  y)) 


sin  (2t)  =  2  sin  t  cos  t  cos(2t)  =  2  cos2  t  —  1 


sin  t  = 


1  —  cos  2 t 


9  1  +  cos  2t 

cos  t  —  - 


sin  t  = 


3  sin  t  —  sin  3t 


cos  t  = 


3  cos  t  +  cos  3^ 


sin  t  cos  t  = 


cost  —  cos3t 


2  sin  t  +  sin  3t 
sin  t  cos  t  = - 


All  trigonometric  identities  can  be  derived  from  successive  applications  of  the  for¬ 
mulas  for  the  sum  or  difference  of  angles  (x  ±  y)  and  further  algebra. 

The  need  for  these  identities  can  be  eliminated  by  using  Euler’s  formula,  el 0  = 
cos  0  +  i  sin  9,  to  replace  cosine  and  sine  by  their  complex  representations, 


elt  +  e 


cos  t  =  Re(Vr)  = 


-it 


sin  t  =  lm(elt)  = 


git _ g  it 


2  i 


then  all  results  follow  from  algebra  and  re-grouping  e±int  to  determine  coefficients 
of  cos  (At)  and  sin  (At)  terms. 
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Appendix  A:  Trigonometric  Identities  and  Fourier  Series 


A.l  Trigonometric  Fourier  Series 

Analogous  to  the  way  that  every  nth  order  polynomial  can  be  expressed  as  a  finite 
Taylor  polynomial,  the  trigonometric  identities  allow  every  nth  order  product  of  sines 
and  cosines  to  be  written  as  a  sum  of  sines  and  cosines.  Fourier  series  generalise  this 
to  represent  all  integrable  periodic  functions  in  terms  of  an  infinite  series  of  sines 
and  cosines. 

Expansions  of  given  periodic  functions,  fix)  on  —  L  <  x  <  L  can  be  written  as 


fix)  =  ^  +  ^an  cos  Cfx)  +^b„  sin(^x) 


OO 


oo 


2  ^ 

n=  1 


n=\ 


(A.  la) 


where  for  n  =  0,  1,  2, 


1  fL  1  fL 

an  =  —  /  f{x)  cos(^v)  dx  bn  =  —  /  fix)  sin(^v)  dx. 

L  J-l  L  J_L 


(A.  lb) 


The  Fourier  series  for  a  function/  will  converge  to  the  value  of  the  function  at  all 
points  where  fix)  is  continuous. 

The  Fourier  expansion  can  also  be  written  in  complex  form  as 


oo 


fix)  =  X! Cfi 


innx/L  _ 

e  cn  — 


n=—oo 


2-  f  fix)e-imx'Ldx.  (A. 2) 

J  — L 


The  Fourier  cosine  series  for  a  function  F(x)  given  on  0  <  v  <  L  can  be  derived 
from  the  general  Fourier  series  by  defining  fix)  to  be  the  even  extension  of  F(x)  : 

Fix)  0  <  x  <  L, 

F(—x)  —L  <  x  <  0 


fix)  = 


F(x)  =  y  +  ^an  cos i™x) 


OO 


n=  1 


2  fL 

an  =  —  Fix)  cos(^v)  dx. 

L  Jo 


(A. 3) 


Similarly,  the  Fourier  sine  series  for  a  function  F(x)  given  on  0  <  v  <  L  can  be 
derived  from  the  general  Fourier  series  by  defining  fix)  to  be  the  odd  extension  of 
Fix)  : 

' 

F{x)  0  <  x  <  L, 

—F(—x)  —L  <  v  <  0 


fix)  = 


OO 


Fix)  =  y sin(^x)  b„  =  — 


•L 


n=\ 


L 


ro 


Fix)  sin(^v)  dx. 


(A.4) 


Solutions  to  Selected  Problems 


Chapter  1 

1.1  Solving  the  ODE  for  m  first  yields  m(t)  =  moe~r .  After  substituting  m(t )  into 
the  equation  for  z(t ),  this  equation  can  be  integrated  twice  and  with  ^(0)  =  0  yields 
z(t)  =  (r  —  g)(et  —  t  —  1).  From  this  solution  we  can  see  that  lift-off  occurs  (z  >  0 
for  t  >  0)  if  r  >  g. 

1.2  For  (F8),  direct  integration  yields  A(t)  =  Ao  +  kt.  For  (F9),  separation  of 

variables  yields  A(t)  =  Aoe~kt  and  for  (1.10)  substitution  into  the  equation  for 
B(t )  yields  B(t )  =  +  Ao  —  Aoe~kt.  For  (1.12),  note  that  dA/dt  =  dB/dt  so 

B{t)  =  A (t)  +  c  where  from  the  IC’s  c  =  Bq  —  Ao,  then  substituting  into  the  ODE  for 
A,  separation  of  variables  yields  A  (t)  =  Ao(Bq  —  Ao)/(Bq  exp(— ^ £(Ao  —  Bo)t)  —  Ao). 

1.3  See  (10.12a). 

1.4  See  Exercise 4.5  and  set  F  =  0  in  (4.49). 

1.5  (a)  All  solutions  starting  from  ro  /  0  approach  either  v  =  ±1.  The  basin 
of  attraction  of  v  =  —  1  is  vo  <  0  and  all  solutions  having  x(t  — >  oo)  — >  1 
have  vo  >  0.  (b)  Second-order  equilibria  occur  where  /  =  0  has  a  double  root, 
f\x *)  =  1  —  3xl  ~  namely  v*  =  d=l/V3.  Substituting  these  v*  into /(v*)  =  0 
yields  k±  =  ±2^3/9.  Plotting  directions  on  the  phase  lines  shows  these  bifurcation 
values  correspond  to  the  changes  in  qualitative  behaviours. 


Chapter  2 

2.1  Using  the  hint,  we  can  express  the  problem  as 


dt 


e^0  dt 


f(x ,  t)  dx 
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Solutions  to  Selected  Problems 


We  are  taking  a  derivative  of  a  product  of  functions  with  the  second  factor  being  an 
integral,  hence  we  apply  Leibniz’s  rule  (with  respect  to  t)  to  it, 


K 


ti 

&Ja 


a+eh 


1 

fdx+  - 
h 


'a+eh  ftf  d(a  +  sh )  da 

—  dx  +f{a  +  sh,  t ) - f(a,  t ) 


a 


dt 


dt 


dt 


]) 


Now  we  are  ready  to  consider  the  limit  for  e  — >►  0.  We  will  expand  out  the  first  two 
terms  in  the  Taylor  series  for  s  — >  0,  using  Leibniz’s  rule  again,  to  take  the  derivative 
of  integrals  (with  respect  to  e), 


H 


s  \  h2 

1 


+ 


h 


h!  r  ra 

/  f  dx  +  sf(a,  t)  —  0 

_•/ a 

a  df  J  df  1 
dx  +  s  —  h 
dt  dt  x=a 


+(/+“^)L(* 


da  dh 
+  8 


dt 


) 


-  f(a ,  t ) 


da 

dt 


]) 


Eliminating  null  integrals,  fcl  gdx  =  0,  and  cancelling  terms  reduces  this  to 

kJ  Ct 


‘( 


sfh!  1 

—  +  - 
h  h 


df  7  rdh  7  df  da  2l  df  dh 


£ — h  +  sf - 1-  sh 


df  df  da 

- 1 - b  8 

dt  dx  dt 


dt 
df  dh 
dx  dt 


dt 


dx  dt 


+  eh 


dx  dt 


.V— <7^ 


x=a 


Hence  we  obtain  the  limit  as  the  convective  derivative  at  x  =  a(t). 


r  4/avg  df  df 

lim  - -  = - b  v  — 

dt  dt  dx 


X—Cl 


In  three  dimensions  it  can  be  shown  that 


/avg  (0  — 


If Id« )  dV 


lim  jhm  =  —  +  y  •  V/ 
D(t)^0  dt  dt 


x=a 


2.2  Expand  the  left-hand  side  of  the  conservation  of  momentum  equation  using  the 
product  rule 


3  (pv)  3  (pv2) 


dt 


dx 


dv  dp  dv  d(pv) 
P—  +  v— -  +  pv  — - b  v 


dt  dt 

dv  dv 
=  p—  +  pv—  +  V 


dx 


dt 


dx 


dp 

~dt 


+ 


dx 

3 (pv) 
dx 
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where  the  terms  on  the  second  line  vanish  by  virtue  of  the  continuity  equation  (the 
conservation  of  mass). 

2.3  (a)  Use  the  trigonometric  identity  cos  a  cos  b  =  2  cos  (^[<2  +  b])  cos  (^[<2  —  b]) 
to  re-write  p  as 


0 


p  =  2  cos  (  7y[(k  +  k  +  s)x  —  {co  ( k )  +  co{k  +  s))t]x 


)  COS  (  J 


[(&  +  £  —  k)x  —  (co(k  +  s)  —  co(k))t] 


) 


For  s  ->  0  the  Taylor  series  gives  co(k  +  s)  =  co(k)  +  co' (k)s  +  •  •  • ;  using  this,  for 
£  ^  Owe  get 

co(k  +  s)  —  co(k)  ~  co' (k)e,  co(k  +  s)  +  co(k)  ~  2 co(k) 


and  then  the  first  term  in  the  expansion  of  p  is 


0 


p  ~  2  cos  (kx  —  co{k)t)  cos  (  ls[x  —  co'(k)t ]  )  =  2  cos  (  k\x 


^  =  2  cos 


co(k) 


^  cos  (\s[x 


CO 


xm), 


Matching  terms  to  the  prescribed  form  identifies  the  phase  and  group  velocities  as 


cp{k)  = 


00  (k) 


Cgik)  = 


dco 

dk 


(b)  Substituting  p  =  cos  (Ur  —  cot)  in  the  PDE  yields 

Pt  +  Px  —  Pxxt  =  (co  —  k  +  00k  )  sin(&v  —  cot)  =  0 
and  forcing  the  coefficient  to  vanish  yields  the  dispersion  relation, 


co(k)  = 


1  +k2 


Similarly,  substituting  p  =  exp  (Ur  —  cot)  yields 


Pt  +  Px  -  Pxxt  =  (-<w  +  k  +  S)k 2)  exp  (fa  -cot)  =  0 


yielding  the  modified  dispersion  relation 


d)(k)  = 


1  —  k2 


2.4  (a)  Substituting  p  =  P(x  —  ct)  in  the  BBM  PDE  yields 
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(b)  Note  that  we  can  integrate  the  ODE  to  yield 


d  —  cP  +  P  +  3  P2  +  c 


d2P 
ds 2 


Now  considering  P(s)  =  Asech2  CBs) ;  for  |s|  — >  oo,  P  — >  0  and  similarly  all  deriv¬ 
atives  P\  P" , . . .  — >►  0.  Hence,  evaluating  the  ODE  for  1^1  — >  oo  we  can  conclude 
that  the  constant  of  integration  is  d  =  0, 


(1 


c)P  +  3P2  +  c 


d2P 
ds 2 


Substituting  in  the  sech2  solution  form,  after  some  algebra  the  ODE  gives, 
(4A  +  16cA£2  -  4cA)  cosh(2 Bs)  +  (24A2  +  4A  -  32cA£2  -  4cA)  =  0 


The  equation  is  satisfied  for  all  s  if  the  coefficients  (in  parentheses)  are  each  zero. 
The  first  coefficient  yields 


(1  +  c)  +  4  cB2  =  0 


* 


then  the  second  coefficient  is  obtained  from 


6A  +  1  —  8  cB 
Hence  the  soliton  is 

p(x,  t )  =  P(x  —  ct)  = 


c  — 


1  9 

-sech 


2 

C  —  1  9 

- sech 


2 


where  the  final  line  is  meant  to  be  of  the  wavenumber-frequency  form,  p  = 
G(kx  —  cot).  To  check  if  this  satisfies  the  modified  dispersion  relationship  found 
in  the  previous  exercise,  consider  if 


c 


k(c)  = 


5>(c)  =  V  c2  —  c, 
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are  parametric  equations  for 


k  \/(c  ~  l)/c  /c  —  1 

co(k)  =  - -r  =  — - —  =  cj - 

1  —  k2  1  —  (c  —  l)/c  V  c 


so  yes,  this  is  verified.  Also,  the  definition  of  the  phase  speed,  c  =  co/k  =  c  holds. 
One  of  many  special  properties  of  solitons  is  that  they  satisfy  dispersion  relations 
obtained  from  the  linearised  version  of  the  PDE  while  having  their  form  determined 
by  the  full,  nonlinear  equation. 

2.5  Using  the  given  forms,  the  third  equation,  1,  t)  =  —f  =  —A  cos (kx  —  cot) 
reduces  to  —coB(  1)  =  —A.  Similarly,  the  fourth  equation,  ft  =  cj)y  =  B'(\)  cos  (kx  — 
cot)  yields  coA  =  B'(  1).  The  second  equation  <py  =  B'( 0)  sin(^v  —  cot)  reduces  to 
=  0.  Finally,  substituting  0  into  <fixx  +  cpyy  =  0  yields 

d^B 

-k2B  +  — -y  =  0  =>►  B(y)  =  c\eky  +  C2e~ky . 

dyz 

Applying  the  boundary  condition  /^(O)  =  0  and  B'(  1)  =  coA  to  the  general  solution 
of  the  ODE  yields 


c\k  —  c^k  =  0, 


ci  =  c2 


coA 

2k  sinh(^)  ’ 


then  B(  1)  =  A/ co  yields  the  dispersion  relation, 

coA  coA  A 

— ; - cosh(/0  =  —  tanh  (k)  =  —  =>- 

k  sinh(^)  k  co 

2.6  (a)  The  initial  value  problems  for  the  characteristic 


co(k)  =  yjk  tanh  (k). 
ODEs  are 


dX  0  dP 

—  =  e2t  X(0)  =  A,  —  =p  +  x  +  t  P(  0)  =  cos  (A) 

dt  dt 


First  solving  for  X  yields  X (t)  =  A  +  ^  ( e2t  —  1)  which  can  then  be  plugged  into  the 
equation  for  P  to  yield 

P(t)  =  (A  +  cos(A))er  -  A  -  t  -  ±(1  -  e2t) 

On  each  characteristic  curve,  we  can  invert  X(t,  A)  to  get  A  =  x  +  i  [1  —  e2t],  then 
substituting  into  P  yields 


p(x,  t)  =  +  ^[1  —  e2t]  +  cos(v  +  ^[1  —  e2t ])^  er 

-(x+i[l-e2r])-t-i(l-e2f). 
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(b)  The  characteristic  ODEs  are 


dX 

dt 


=  X  +  4, 


dP 

dt 


=  -IP. 


The  general  solutions  of  these  equations  are 

X(t)  =  c\e’  -  4,  P(t )  =  c2e~2’. 


The  side  conditions  give  data  in  two  parts. 

For  A  >  0,  we  have  X(0)  =  A,  P(0)  =  e~A  at  t  =  0.  Applying  this  to  the  general 
solution  yields 

X(t )  =  (A  +  4)e‘  -  4,  P(t)  =  e~A~2t. 

Inverting  x  =  X(t,  A)  yields  A  =  (x  +  4)e~  -i  -  4  >  0  and  substituting  this  into  P{t) 
yields 

p(x,  t)  =  exp  (4  —  (x  +  4)e~r  —  It)  for  v  >  4(e 1  —  1). 

The  second  part  of  the  data  is  given  at  v  =  0  =>-  X(T)  =  0  with  P(T)  =  cos (T) 
for  t  >  0  =>•  T  >  0.  Applying  these  conditions  to  the  general  solution  yields 

X(t)  =  4e‘~T  -  4,  P{t)  =  cos(7>“2r+27\ 


Inverting  x  =  X(t,T)  yields  T  =  t  —  ln(l  +x/4)  >  0  and  substituting  this  into  P(t) 
yields 

p(x,  t)  =  cos  ^  —  In  ^1  +  j  ^1  +  forv  <  4(et  —  1). 

2.7  (a)  Expanding  out  the  product  rule  and  using  the  given  velocity  yields  the  PDE 


dP  .  2 


dt 


4- x  e 


=  —2xe  3rp 


dx 


and  consequently  the  characteristic  ODEs 


dX 

dt 


=  X2e~3r,  X(0)  =  A,  for  1  <  A  <  2. 


and 


dP 


-2 XPe~3t 


dt 


Pi  0)  =  1. 
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The  problem  for  X(t)  yields 


X(t,A) 

and  the  problem  for  P(t )  yields 

Pit,  A) 


Inverting  v  =  X(t,  A)  for  A  yields 


and  substituting  this  into  P(t,  A)  yields 

p(x,t)  =  -  1]  —  l)  . 

2.10  (a)  Substituting  in  the  definitions  of  p,  q  in  the  first  equation  yields  <fitt  —  c 2 
<fixx  =  0  (the  wave  equation)  and  <fixt  —  <fitx  =  0,  always  true  by  the  identity  of  mixed 
partial  derivatives  of  smooth  functions. 

(b)  The  system  can  be  written  as 

a_(p\>(  °  ~c2\  a  (p\  =  (°\ 

3t  \q)  V-1  0  )  3  x\q)  V0/' 

^ - V - ' 

M 


Following  the  approach  given  in  Sect.  2.4,  we  obtain  the  wavespeeds  from 


1 

>• 

1 

1 — ‘ 

1 

(N 

1 

>  X  =  =hc. 


The  corresponding  eigenvectors  are  then 

A]  =  c  W\  (x  —  ct)  =  p(x,  t )  —  cq(x,  t ) 

A 2  =  —  c  W2(x  +  ct)  =  p(x,  t )  +  cq(x ,  t) 

(b)  Then  the  solutions  can  be  expressed  in  terms  of  W\ ,  W2  are 

1  1 

p(x,  t)  —  -  (W\(x  -  ct)  +  W2(x  +  ct))  q(x,  t)  —  —  (W2(x  +  ct)  -  W\  (x  -  ct)). 
2  2c 
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(c)  We  can  relate  A,  B  to  W\ ,  W2  from 

30  1  , 

p  =  —  =  -  (W \(x  —  ct )  +  W2(x  +  ct ))  =  —cA  (x  —  ct )  +  cB  (x  +  c£) 

3?  2 

30  1  , 

<7  =  —  =  —  (W2(x  +  ct)  —  W\(x  —  ct))  =  A  (x  —  ct)  +  B  (x  +  ct) 
dx  2c 

Both  of  which  yield 

/  1  /  1 

A  (x  —  ct)  = - W\  (x  —  ct)  B  (x  +  ct)  =  — W2(x  +  ct). 

2c  2c 

(d)  At  t  =  0  the  initial  condition  0(A,  0)  =  fix)  implies  (px(x,  0)  =  q(x,  0)  =f'(x) 
and  <pt(x,  0)  =  p(x,  0)  =  g(x),  and  consequently 

Wi(x)  =  g(x)  -  cf\x)  W2(x)  =  g(x)  +  cf\x) 

Hence,  from  part  (c) 


A'(x)  =  -2  (g(x)  -  cf(xj) 


A{x)  =  /"  A'(iUB  +  a  =  m-m 


l 


l 

2 c 


*x 


g(x)  dx  +  a 


B\x)  =  —  (g(x)  +  cf'(x)) 


'X 


B(x)  =  I  B'(x)  dx  +  b  =  A2 — f()_\ - I  g(x)dx  +  b, 

f0  2  2c  Jo 


where  a,  b  are  constants  of  integration. 

So  using  <f>(x,  t)  =  A(x  —  ct)  +  B(x  +  ct), 

J>=\  (f(x  ~  ct)  -/( 0))  ~  f  f  g(x)  dx  +  a 

2 c  Jo 

^  rx+ct 

+  \  {fix  +  ct)  -/( 0))  +  —  /  g(x)  dx  +  b 

2 c  Jo 

^  rx+ct 

=  \  (fix  +  Ct)  +f(x  -  ct))  +  —  /  g(x)  dx+(a  +  b  -/( 0)). 

2c  Jx_ct 

Checking  the  initial  condition,  0(x,  0)  =f(x). 


1 

<P(x,  0)  =  \if(x)  +f(x))  +  —  J  g(x)dx+(a  +  b  -f( 0)), 


=0 
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so  the  constants  of  integration  should  be  chosen  so  that  a  +  b  — /( 0)  =  0,  yielding 
the  D’Alembert  solution. 


2.12  The  conservation  law  pt  +  q(p)x  =  0  can  be  re-written  as  pt  +  qf(p)px  =  0, 
from  which  we  can  write  the  characteristic  equations  as 

dP 

—  =0  =>>  P  =  f(A)  =  constant 

dt 

—  q  (P)  =*  X  =  q  (f  (A))t  +  A 

dt 

(a)  The  two  characteristic  curves  will  intersect  when  X(top,Ao)  =  X(top,A\), 
namely 


q'(f  (Ao))4,i  +^o  —  q'  (f  (A\))tQi  +Ai 


^0,1  =  - 


A\  ~Aq 


q'(f(A\))  —  q'(f(Ao)) 


and  subsequently  using  this  value  of  top,  xo,i  =  q'(f(Ao))top  +  Ao  (or  equivalently 
in  terms  of  A\). 

(b)  Minimising  top  over  all  possible  Aq,  Ai  to  get  4,  consider  A  i  =  Aq  +  s. 


4  =  mm  — 


A\  -  Ao 


=  mm  — 


AoAi  q'(f  (Ai))  -  q'(f  (A0))  A0,e  q'(f(A0  +  e))  -  q'(f(A0)) 


Taking  the  limit  £  ^  0  and  recalling  the  limit  definition  of  the  derivative, 


>  0. 

2.13  (a)  The  characteristic  ODEs  are  dX/dt  =  P,  dP /dt  =  0,  as  in  (2.41).  Applying 
the  given  initial  conditions  yields  the  parametric  solutions 


4  =  min 


1 


*  _  q"(f _ 


X(t,  A)  =  (9  —  A2)/  +  A,  P(t,A)=9-A2  \A\<3 
X(t ,A)=A,  P(t,A)=  0  |A|  >  3 


(b)  To  invert  X(t,  A)  =  v  for  the  nontrivial  part  of  the  solution,  we  recognise  it  as  a 
quadratic  equation  for  A, 


A2t  —  A  +  (x  —  9 1)  =  0 


A  = 


1  ±  vT  -  4 (xt  -  9 12) 


It 
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then  we  can  substitute  into  P  to  obtain 

For  more  general  initial  conditions,  p(x,  0)  =f(x),  for  the  inviscid  Burgers’  equation 
we  have  X  =  Pt-\-  A  with  P  on  a  characteristic  curve,  so  we  can  still  write  A  =  x  —  Pt 
to  obtain  an  implicit  equation  for  the  solution,  p  =  f(x  — pt ),  here p  =  9  —  (x  —pt)2, 
to  yield  the  equivalent  form 


2 xt  —  1  zb  V36 12  —  4 xt  +  1 
pto  0  = - Wl - 

See  below  for  a  graph  of  this  multi-valued  solution  at  time  t  =  1 . 


X 


(c)  The  result  of  Exercise  2. 12  gives  that  the  time  at  which  the  shock  forms  is  deter¬ 
mined  by  the  initial  conditions, 

4  =  min  ( 4  =  min  ( 

x  \  f  (x)  /  jce[— 3,3]  \  2 X  J  6 

and  the  shock  will  first  form  at  v*  =  3. 

(d)  The  Rankine-Hugoniot  equation  for  the  inviscid  Burgers’  equation  (2.49)  and  the 
result  from  part  (c)  give  the  ODE  problem  for  the  shock  position  as 


dxs  2  xst  —  1  +  y/3  6t2  —  4  xst  +  1 

dt  At2 


xs(  1/6)  =  3, 


where  the  solution  ahead  of  the  shock  is  p+(x,  t)  =  0  and  the  solution  behind  the 
shock,  p-  (. x ,  t)  is  given  by  the  solution  above  with  the  positive  sign  on  the  square  root. 
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x 


(e)  The  inviscid  Burgers  equation  is  a  conservation  law  for  the  integral,  f  p  dx , 

P,  +  (V)i  =  o  =>  (//>*)=» 

Hence  the  conserved  value  is  set  by  the  area  under  the  initial  condition  f  (9  —  x2)  dx  = 
36.  Using  the  form  of  the  solution  with  the  shock,  the  equation  that  f  p(x,  t)  dx  =  36 
becomes  (for  t  >  1/6,  after  the  shock  forms), 


^  2 xt  —  1  +  V3 6t2  —  4 xt  +  1 

2fi 


dx  =  36. 


Evaluating  the  integral  yields  an  implicit  algebraic  equation  for  xs(t), 


1 


(1  -  4 xst  +  36/ 2)3/2  -  6xst  +  216 /3  +  54/2  +  l)  =  36. 


Chapter  3 

3.1  (a,  b)  Both  approaches  give 


-y"  +  \y  +  x2  =  0. 
xz 

Multiplying  across  by  x2,  this  equation  can  be  recognised  as  an  inhomogeneous 
Cauchy-Euler  equation,  x2y" —ky  =  —x4,  solvable  in  terms  of  a  sum  of  homogeneous 
and  particular  solutions. 

(c)  For  k  =  0  the  general  solution  is  y(v)  =  v4/12  +  c\x  +  C2.  Applying  the 
boundary  conditions  yields  c\  =  — 1/12,  C2  =  1. 

(d)  For  k  =  2  the  general  solution  is  y(x)  =  v4/10  +  c\x2  +  C2/X.  Applying  the  first 
boundary  conditions  yields  c\  =  9/10,  C2  =  0.  The  condition  y(0)  =  1  cannot  be 
satisfied  because  unless  C2  =  0  that  term  would  diverge  and  the  other  terms  in  the 
general  solution  vanish  for  v  =  0. 
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3.2 


(/Q2 

0  +  (/)2)3/2 


dx 


(fep2 

(i +b2)3/2 


dx  >  0, 


where  52/*  will  be  strictly  positive  unless  hr  (x)  =  0,  but  from  the  boundary  conditions 
on  y,  the  boundary  conditions  on  h  are  h(0)  =  h(l)  =  0.  Hence  h!  =  0  only  if  h  =  0. 


3.4  (a)  The  x(t)  Euler-Lagrange  equation  can  be  simplified  to 


x"(y')2){\  +  4k2 (x2  +  y2))  +  4 k2((x)2  +  (y')2)(x'y  -  xy')y  =  0. 


The  y  Euler-Lagrange  equation  takes  the  same  form  after  interchanging  x  y. 

(b)  The  Euler-Lagrange  equation  for  y(x)  can  be  reduced  to 

„  4fc2(l  +  (/)2)(V-y) 

1+4  k2(x2+y2) 

(c)  For  (ii),  since  z  is  constant  on  the  semicircle,  the  distance  is  just  the  arclength  of 
the  semi-circle,  Ju  =  n  for  all  k. 

For  (i),  we  have  y  =  0  and  z  =  kx2,  hence  the  arclength  can  be  calculated  from 

p  1  _  _  j 

Ji  (k)  =  /  vT-b  4 k2x2  dx  =  VTT  4k2  H - arcsinh(2k). 

J- 1  2  k 

For  k  =  0,  Ji  (0)  =  2  giving  the  length  of  the  straight  line  in  the  xy  plane,  while  for 
k  >  0  Jt  is  an  unbounded  monotone  increasing  function  of  k,  Ji(k  — >  oo)  — >  oo. 
Let  k*  be  the  value  of  k  for  which  //(£*)  =  tv.  Then  for  0  <  k  <  k*,  the  straight-line 
path  will  be  shorter  than  the  semi-circle  detours.  But  for  k  >  k*,  the  semi-circle  is 
shorter. 

3.5  (a)  After  the  substitution,  the  action  in  terms  of  0  (, t )  is 


^ml2(0')2  +  mgi  cos  0  dt 


and  from  (3.21)  applied  to  Q(t)  the  resulting  Euler-Lagrange  is  the  equation  of  the 
pendulum, 


0"  +  -  sin  0  =  0. 

£ 


(b)  See  Exercise  3.27  for  the  Euler-Lagrange  equation  for  x(t). 
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3.6  Substituting  the  parametric  equation  for  x,  y  into  the  general  Lagrangian,  L  = 
\m{{x')2  +  (y')2)  -  mgy  yields 

L  =  \ml2  ( 0f )2  +  mgl  cos  0  —  am  (too6!  cos  (cot)  sin  6  —  g  sin  (cut))  +  \ma2co2  cos2  {cot) 

Applying  (3.21)  then  yields  the  (vertically-oscillated)  parametrically-driven  pendu¬ 
lum  equation, 


0"  +  -  sin  (9  = - sin(<wO  sin0. 

i  i 

3.7  (a)  Starting  from  H  =  —L  +  y'dyL , 


dH 

dt 


d  (  ,dL 

—  (  - L  +  y '  — 
dt  \  dy ' 

-( 


) 


9L  dL  dy  dL  dry 

—  +  —4  + 


dL 

~dt 


dy  dt 
dL 


y 


_dy 


d 

dt 


) 

(i)] 


dyr  dt2 


dry  dL  t  d 
dt2  dyr  dt 

=  0, 


where  the  final  step  makes  use  of  the  assumption  that  dtL  =  0  to  leave  —  y'  times  the 
Euler-Lagrange  equation.  Hence  Hf  =  0  and  H  is  a  constant,  justifying  (3.59). 

(b)  Writing  the  Lagrangian  as  L  =  T(t,y,yf)  —  V(t,y)  the  condition  that  the  Hamil¬ 
tonian  is  the  total  energy  is 


,dT  ,dT 

H=-T  +  V  +  y' —  =  E  +  V  =>  y  —  =2  T 

y  dy'  y  dy' 

This  is  a  first  order  separable  ODE  for  T  as  a  function  of  y'  and  yields  T(t,y,yf)  = 
A(t,  y)(y')2  where  A(t,y )  is  any  function  of  t  and  y. 

3.8  (a)  At  t  =  0,  the  mass  starts  at  v  =  0,  y  =  1  from  rest,  v  =  0,  hence  the  initial 
energy  is  Eq  =  mg.  Equating  the  energy  at  later  times  with  Eo  yields 

\mv2  +  mgy  =  mg  =>■  v(y)  =  V2g(l  -  y) 

Consequently  the  functional  for  the  time  of  travel  is 


1  +  (yO2 

2g(l-y) 


dx 


L(y ,  y) 


l  +  (Vg 

2g{l-y) 
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(b)  The  Euler-Lagrange  equation  for  y(x)  can  be  simplified  down  to 


1 

2 


( 


1  +  (y')2\l/2  d( 

(1  -  y)3  /  dx  ^ 


_ f ) 

\/(l  +  (/)2)(l  -y)J 


but  this  is  a  difficult  looking  equation,  so  we  turn  to  a  different  approach. 

(c)  The  Beltrami  identity  (3.59)  is  applicable  since  L  does  not  explicitly  depend  on 
x,  so 


i  +  (/)2 


GO 


/ 12 


2sC  -  y)  s/2g{\  +  -  y) 

1 

=  c 


\/2g(i  +  (j,)2)(i  —  y) 


At  x  =  0,  y  =  1  and  to  make  C  finite,  the  slope  would  have  to  diverge,  /  — >•  — oo, 
so  this  form  is  indeterminate.  At  x  =  1,  we  have  y  =  0,  so  C  =  1  /J2g(  1  +  /(l)2), 
but  we  don’t  know  /(l).  Still,  we  can  write  a  singular  ODE  problem  for  y(v), 


dy 

dx 


1 


2gC2(l-y) 


y(0)  =  i,  y(i)  =  o. 


(d)  Substituting  the  parametric  equations  of  the  cycloid  (3.60)  into  the  ODE  gives 
a  necessary  relation  between  C  and  k,  1  =  AgC2k.  The  initial  conditions  at  0  =  0, 
x(0)  =  0  and  y(0)  =  1  are  satisfied  automatically  for  any  k.  The  final  state,  x  =  l 
and  y  =  0  at  some  6=6*  yield  two  coupled  equations  for  finding  (k,  6*): 


k(6 *  —  sin#*)  =0  1  —  k(  1  —  cos  6*)  =  0. 


3.10  Following  the  approach  given  in  Sect.  3.4.2  with/ (v)  =  l+(x—  l)2,y  =  y*-\-sh 
and  b  =  b*  +  sc ,  we  obtain  the  first  variation  as 


y"hdx+  jy'*(b)2c. 


The  boundary  condition  at  the  origin  determines  that  h( 0)  =  0.  Expanding  out  the 
boundary  condition  y(b)  =f(b)  yields 


y*(b *)  =f(b *),  y*(b*)c  +  h{b*)  =f\b*)c 
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consequently,  c  =  h(b*) / (f! (b*)  —  (/?*)).  Using  this,  the  first  variation  can  be 

re-written  as 


- ^2 - )  h(K) 

2  (f'(b*)-yimj 


There  are  two  possible  choices  for  natural  boundary  conditions  that  would  make 
the  boundary  term  vanish  for  all  possible  choices  of  h(b):  either  yr*(b*)  =  0  or 
(after  some  algebra)  y*(b*)  =  2 f'(b*)  (we  will  explore  both  possibilities).  Subject 
to  natural  boundary  conditions,  applying  the  fundamental  lemma  to  the  critical  point 
condition  8J*  =  0,  we  obtain  the  Euler-Lagrange  equation  y"  =  0.  After  applying 
the  boundary  condition  y(0)  =  0,  viable  solutions  are  of  the  form  y*(x)  =  Ax.  The 
boundary  condition  y*(b*)  =  0  would  yield  A  =  0,  but  this  option  must  be  rejected 
since  the  solution  would  not  reach  the  curve  y  =f(x).  Hence,  using  y*  (b*)  =  2f' (b* ) , 
we  get  A  =  4 (b*  —  1)  and  then  y*(Z?*)  =  f(b*)  yields  the  quadratic  equation, 
3 bl  —  2b*  —  2  =  0  and  the  solution 


4  i-  1  +  V7 

y*(x)  =  -(>/7  -2)x  0  <x  <  — - — . 

3.12  Beginning  by  substituting  y  =  y*  +  eh  and  expanding  for  e  — >  0  yields 

J  =  J*  +  £  f  hfy"f  +  y'*h"'  —  240 xh  dx  +  •  •  • 

Jo 

Using  integration  by  parts  to  get  the  derivatives  off  of  the  K s,  we  arrive  at  a  form 
where  the  fundamental  lemma  can  be  applied  to  critical  point  condition,  yielding  the 
ODE, 

-2/"  -  240v  =  0. 


Note  that  all  of  the  boundary  terms  vanish  thanks  to 

)/(0)  given  =>-  h'( 0)  =  0  y"(  1)  given  =>•  h"(  1)  =  0 
yr//(0)  given  =>►  h"'( 0)  =  0  y(l)  given  =>•  h(  1)  =  0 


The  ODE  can  be  integrated  directly  to  give  the  solution  as  a  polynomial,  after  applying 
the  four  boundary  conditions,  the  final  solution  is 

y*(x)  =  —x5  +  10x2  —  4. 
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3.13  Beginning  by  substituting  y  =  y*  +  sh  and  expanding  for  s  — >  0  yields 


J  —  J±  £ 


'X 


2 y*h'  +  (1  -  2x)  j  2 y*(t)h(t)dt 


dx  T- 


Applying  integration  by  parts  yields  the  first  variation  as 


(= 


2 y*h  +  (x-  x2) 


2 y*h  dt ^  |  -  J  [ly'l  +  2(x  —  x2)y,fj  hdx 


The  second  boundary  term  vanishes  automatically  at  v  =  0  and  x  =  l  due  to 
the  x(l  —  x)  factor.  The  first  boundary  term  vanishes  under  the  natural  boundary 
conditions 


2yi(0)ft(0)  =  0 


>  04(0)  =  0  or  h( 0)  =  0  y*(0)  =  A} 


and 


2yi(l)*(l)  =  1 


>  04(1)  =  0  or  Ml)  =  0  =*  y*(l)  =  B } 


3.14  Beginning  by  substituting  y  =  y*  +  sh  and  expanding  for  s  ->  0  yields 


l 


J  =  J*+e\  j  2^34'Zi"  +  +  yiC0)/i'(l)  +  y;(l)*'C0)  )  + 


■) 


Integrating  by  parts  yields  the  first  variation  as 


(2y*(lK'(l)  +y,(0))  A'(l)  +  (— 2y*(0)y"(0)  +  /*(1))  h\ 0) 

+  J  *vA  +  (vJ2  +  hdx 


Hence  the  Euler-Lagrange  equation  is 


d2y*\2  dz  (  dzy*\ 

+  2  ( y*  ^2  )  +  2y*  —  0 


/  dzy*  \ 
\  dx 2  / 


dx 2 


and  the  further  natural  boundary  conditions  needed  are 


{2y*(l)y*(l)  +  y*(0)  =  0  or  h\  1)  =  0 


*  34(1)=  A} 


and 


{-2y*(0)/;(0)  +34(1)  =  0  or  h\ 0)  =  0  =*  y'#(0)  =  B } 
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3.15  (a)  Using  v  =  c/n(x),  functional  (3.6)  becomes 


T(y )  =  - 
c 


1  [  n(x)J^dx 

c  Jo  v 


L(X,y')  =  1  +  (j')2 


Applying  (3.21)  to  this  Lagrangian  yields 


0- 


d  (  n(x)y ' 


dx  vvi  +  t/)2 


=  0, 


which  can  be  integrated  once,  and  applying  the  initial  condition  yields 


n{x)y' 


yt + (/)2 


=  C  = 


*(0) 

V2 


dy  n( 0) 

dx  y/2n(x)2  —  n(  0)2 


(b)  Yes,  sin  0i  =  ^2  sin  #2- 

(c )  L(x,y,yf)  =  n(x,  y)J  1  +  (/)2 '/c  yields  the  Euler-Lagrange  equation 


y/i  +  (y O2 


d  (  n(x,  y)y' 


dy  d*WTTW2 


=  o. 


3.16  (a)  The  total  kinetic  and  potential  energies  are  given  by 


1—2j~  17  ^  \El\ldx 


T  =  I  -puj  dx  V  = 


ro 


2  x 


(b )  L  =  T  —  V  then  the  action  is 


*t\ 


‘t\  pi 


J  =  j  Ldt  = 

to  J  to 


dxdt 


(c)  Substituting  u(x,  t)  =  u*(x,  t)  +  sh(x,  t)  and  expanding  for  s  — >  0  yields 

pt\  pi 

J  =  J*+e  /  (pu*tht  -  EIu*xxhxx)  dxdt  H - 

d  to  do 


Splitting  the  0(s)  term  into  separate  integrals,  we  interchange  the  order  of  integration 
on  the  first, 

nt\  pt\  pi 

pu*tht  dt  dx  —  /  /  EIu*xxhxx  dx  dt 

j  J  to  do 
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Applying  integration  by  parts  with  respect  to  the  inner  integrals  respectively  yields 


The  perturbation  in  the  solution  at  initial  and  final  times  t  =  to,  t\  can  be  assumed 
to  be  zero  h(x,  to)  =  h(x,t\)  =  0  to  eliminate  the  first  boundary  terms.  Eliminating 
the  second  boundary  term  defines  natural  boundary  conditions  to  remove  the  u*xxhx 
and  u*xxxh  terms  at  each  time, 

{u*xx( 0,  0=0  or  hx( 0)  =  0  =>►  m**(0,  t)  =  A(0} 


and 


{w*xxc(0,  0  =  0  or  h( 0)  =  0  =>►  w*(0,  t)  =  B(t)} 


and  similarly  for  the  boundary  conditions  at  v  =  1 . 

Applying  the  fundamental  lemma  to  ff  (pu*tt  +  EIu*xxxx)h  dx  dt  =  0  yields  the 
beam  equation  PDE  as  the  Euler-Lagrange  equation, 


P 


+  El 


d4u 

dx4 


3.17  Substituting  u(x,  y )  =  u*(x,  y)  +  sh(x,  y)  and  expanding  for  s  — >  0  yields 


J  —  J*  +  £ 


u*xhx  T 


hy^j  dA  T  •  •  • 


The  first  variation  can  be  written  in  vector  form  as  ff  kVu*  •  VhdA  then  matching 
to  the  product  rule  with  g  =  kS7u*  and /  =  h ,  it  can  be  expressed  as 


( hkWu *)  dA 


V  •  ( kVu*)hdA 


Applying  the  divergence  theorem  to  the  first  integral  changes  it  to  an  integral  on  the 
boundary 


8J*  = 


h(kn  •  VuA  ds 


V  •  (kWuAhdA 
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Eliminating  the  boundary  integral  leads  to  two  choices  for  natural  boundary  con- 
ditions:  (i)  homogeneous  Neumann  conditions,  n  •  Vu  =  7^  =  0,  or  (ii)  specified 
Dirichlet  conditions,  h( d D)  =  0  =>►  u(dD )  =/(3D)  given.  Subsequently,  the  fun¬ 
damental  lemma  can  be  applied  to  the  remaining  double  integral  to  yield  the  elliptic 
PDE 


V  •  (jfcVw)  =  0. 


3.19  (a)  The  augmented  Lagrangian  for  this  problem  is 


&=i  +  (y'f  -  a.  (f  -  f )  , 

then  the  Euler-Lagrange  problem  is 


d2y 

-Z+\y  =  0,  y(0)=0,  y(n)  =  1, 


=  80. 


The  ODE  is  a  linear-constant  coefficient  equation;  it  breaks  down  into  two  cases, 
depending  on  the  value  of  A  ^  0. 

If  A  =  —a2  <  0  then  the  solution  of  the  boundary  value  problem  for  the  ODE  is 


y-(x) 


sinhCav) 

sinh(a7r) 


If  A  =  a2  >  0  then  the  solution  is 


y+(x)  = 


sin  (co:) 
sin(o'Tr) 


In  the  two  cases,  the  integral  becomes 


I±(a)  = 


2ai x  —  sin(2o'7r) 
4a  sin2 (an) 


sinh  (2a  tv)  —  Ian 
4a  sinh2  (an ) 


I+(a)  >  Tt  /  3  and  has  multiple  solutions  for  any  value  of  the  constraint  greater  than 
7r /3  ~  1.047.  Meanwhile  0  <  /_  (a)  <  Tt / 3  and  is  monotone  decreasing,  so  there  is 
a  single  solution  for  each  value  of  the  constraint. 
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V^i 


3.20  (a)  Recall  Green’s  theorem, 


P(x,  y )  dx  +  Q(x,  y)  dy  =  [  f  ( ^  ^  I  dA 

3D  JJD\dx  dy 


) 


We  have 


1 


ro 


\x(t)y\t)  —  y{t)xr {t)\  dt  =  (p  ^ xdy  —  ^ y  dx . 


We  can  identify  P  =  —  ^y,  Q  =  ^x,  therefore, 


1 


ro 


\x(t)y'(t)  —  y(t)x' (t)\  dt  =  //  dA  =  Area 


(b)  The  Euler-Lagrange  equations  are 


“(:  •  ,  ^  V"  (  -  ■  lv'  'l 

dt  V  +  (/)2/  dt  V  y co2 + (jo2  / 


=  0 


(c)  Integrating  once  yields 


y  + 


Xx' 


vWTW 


=  C]  —  V  + 


Xy' 


y  (x')2  +  (y')2 


=  C2 


and  can  be  re-arranged  to  yield 


J\2  ,  /,,/\2 v  2 


(c\  y)2  +  ( C2  +  x)2  =  a.2  -4  ^  ^-4  )  =  a2 


2/(^r  +  (y)- 

(vO2  +  (/)2 


) 
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Hence  we  have  the  equation  of  a  circle  with  radius  X  =  P/(2n)  and  centre  (—C2,c\). 
Having  the  origin  on  the  circle  means  that  c\  +  c\  —  X2,  hence  we  can  take  c\  = 
X  sin  6 ,  C2  =  X  cos  0  for  any  0 . 

3.23  (a)  The  augmented  functional  is 


A  (24  -xy  —  5) 


dx. 


(b)  Substituting  y(x)  =  y*(v)  +  sh(x)  and  expanding  for  s  — >  0  yields  the  first 
variation  as 

81  =  J  12  y*h  +  2  x2y'*h'  —  24xhdx 

Examining  the  boundary  condition  y( 2)  =  y(l)  +  3  gives  that  h( 2)  =  h(  1), 
and  applied  to  the  boundary  terms  produced  by  integration  by  parts,  = 

2(4/* (2)  —  y*(l))h(l),  hence  we  determine  the  natural  boundary  condition 

4y*(2)  —  3^(1)  =  0 


and  then  the  Euler-Lagrange  equation  is 

12y  —  (2  x2y')'  —  24Xx  —  0  =>►  x2y"  +  2  xy  —6 y  =  —  \2Xx. 

This  is  an  inhomogeneous  Cauchy-Euler  equation  with  solution 

O  _ Q 

y  =  c\x  +  C2X  +  3Xx 

(c)  Substituting  into  the  boundary  condition,  the  natural  boundary  condition  and  the 
integral  constraint  yields  three  equations  for  c\,  C2,  X, 


1  9 

3ci - C2  +  3X  =  3,  14ci  H — C2  +  9X  =  0,  90ci  +  12c2  +  168A  =  5. 

8  4 

3.24  The  augmented  functional  for  the  constrained  problem  is 


*2  ^y2  (l/)^ 

1=1  — - ^ —  ^(y  +  3)  dx 


l  X 


X' 


The  resulting  Euler-Lagrange  equation  for  y  =  y*  (x)  is 

x2y"  —  3  xy'  +  3y  =  ^Xx5 . 
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The  solution  of  this  inhomogeneous  Cauchy-Euler  equation  is 

3  ^5 

y  =  c\x  +  C2X  +  —~x  . 

16 

Applying  the  boundary  conditions  and  the  integral  constraint  determine  c\  =  11, 
6*2  =  —8,  X  =  16. 

3.26  (a)  The  Hamiltonian  is 

=  L  Xf  =  4x 2  3 xu  T~  u2  H-  X(3x  H-  u). 


The  PMP  yields 


dx  dX 

—  =3  x  +  u,  —  =  — 8v  —  3u  —  3X,  3x  +  2u  +  X  =  0. 
dt  dt 


Eliminating  X  reduces  the  problem  to  a  system  of  two  linear  ODEs  in  x,  u , 


dx 

dt 


=  3x  +  u 


du 

dt 


=  —  5x  —  3  u 


having  the  general  solution 


x(t)  =  c\e2t  +  c^e  2t  u(t )  =  —  c\e2t  —  5c2e  2t 


Substituting  these  into  the  Hamiltonian  yields  =  16e*iC2.  The  Hamiltonian  is 
indeed  a  constant,  and  to  enforce  Jf?  =  0  we  need  either  c\  =  0  or  C2  =  0.  The 
possibility  of  c\  =  C2  =  0  can  be  excluded  because  it  would  give  the  trivial  solution 
and  could  not  satisfy  the  initial  condition  v(0)  =  2.  Consider  C2  =  0,  then  c\  =  2; 
this  yields  a  monotone  increasing  function  for  x(t)  >  2  and  could  never  satisfy  the 
target  condition  x(T)  =  1.  Hence  c\  =  0  yielding  x(t)  =  2e~2t  and  the  target 
condition  determines  T *  =  ^  In  2. 

(b)  Everything  remains  the  same,  but  since  T  =  \  is  imposed,  we  lose  the  natural 

boundary  condition,  that  =  0,  so  imposing  v(0)  =  2  and  x(^)  =  1  determine 
the  constants,  and 


2e  —  e1/2 
e  —  l 


) 


u(t)  =  — 10<?~2;. 


For  this  solution,  the  value  of  the  Hamiltonian  is 


M>  =  - 


16(e  —  2)(2e1/2  —  1) 

(e-  l)2 


-7.21  <  0. 
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This  illustrates  the  maximum  in  PMP,  at  the  optimal  solution  (having  the  optimal 
stopping  time),  the  value  of  the  Hamiltonian  will  be  maximised;  in  general  <  0. 

Chapter  4 

4.1  For  cases  (i,  ii,  iii)  the  scaled  problem  is 


d2y 

dt2 


ni 


(1  +  n2y)2 

4nGpEF{ET2 

3L 


>-(())  =  n3  /(0)  =  -n4 

L  2  V0T 

~  Re  n'  =  Z  ~ 


(i)  Starting  with  774  =  1  we  get  L  =  VoT  then  773  =  1  sets  L  =  2,  T  =  2/ Vo, 
/7i  =  87tGpeRe/(3V?)  =  e  0 


d2y  _  s 

dt2  ~  (1  +  n2y)2 


v(0)  =  1  /(0)  =  —  1 


(ii)  Starting  with  /7|  =  I  we  get  L  =  4ttGpeReT2/3  then  773  =  1  sets  T 
t/3/(2^GpeRe)  and  L  =  2  and  n4  =  Voa/3/(87tGpeRe)  =  s  — >  0 


d2y  =  1 

dt2  (1  +  n2y)2 


v(0)  =  1  /(0)  =  -e 


(iii)  Starting  with  773  =  1,  we  get  L  =  2,  then  774  =  1  sets  T  =  2/ Vo  and 
77i  =  87tGpeRe/(3V?)  =  e  -*  0 


d2y  _  s 

dt2  ~  (1  +  n2y)2 


v(0)  =  1  /(0)  =  —  1 


For  (iv)  the  scaled  problem  is 


d^y 

dt2 


ni 


t y  +  n2)2 

gmet2 


L3 


n2  = 


>40)  =  n3  /( 0) 

Re 


L 


2 

n3  =  - 

L 


=  -nA 


n4  = 


V0T 

L 


Setting  n3  =  n4  =  1  yields  L  =  2  and  T  =  2/Vq  and  n2  =  RE/2  =  e  — >  0 


d2y 


dt2  (y  +  e)2 


>'(0)  =  1  >4(0)  =  -1 
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4.2  Defining  the  nondimensionalized  solution  as  X  =  L x(t)  with  T  =  T7,  we  can 
write  the  scaled  problem  as 


x'  +  x  =  U\  sin(/720>  v(0)  =  1,  xr(0)  =  77 3 

with  L  =  Xo  being  an  imposed  scale  set  by  the  IC  and  derived  timescale  T  =  ^/M/K 
being  the  inverse  of  the  natural  frequency  coo  =  a/K/M  and 


72 

n2  =  —, 

0)Q 


0V0 


4.3  The  choice  of  scalings  L  =  B/VKM,  T  =  M/B  yields  the  nondimensional 
problem 


x"  +  xr  +  xJ  =  TJ\  sin(/720,  v(0)  =  773,  U(0)  =  774, 


with  parameters 


nx  = 


fk1/2m3/2 

B3 


M  AK1/2M1/2 

n2  =  co~,  =  — gi — 


774  = 


CK1/2M3/'2 

B2 


4.5  (a)  After  some  algebra,  we  get  the  scalings 


DE2  E  E2 

X  —  _  y  =  _  7  —  _ 

CFH’  F’  FH’ 


T  = 


DE 

BCH’ 


then  the  dimensionless  parameters  follow  as 


a  = 


AF 

BE’ 


BCH2 

Q2E2’ 


Y  = 


BC 

DE’ 


8  = 


3GE 


and  /i  =  Xq/X,  a  =  Yq/Y,  co  =  Zq/Z 

(b)  x'  =  a  —  y,  Py'  =  x  —  z  and  0  =  y  —  y2  +  ^8y3  —  z.  Solving  the  last  equation 
for  z  =f(y)  yields  a  phase  plane  system  for  (. x ,  y) 


2  1  O  3 


x  =  a-y  fiy  =x-y  +  y  -  ~8y 


A  mismatch  will  occur  unless  the  initial  conditions  satisfy  co  =f(cr). 

(c)  x'  =  a  —  y,  0  =  v  —  z,  yz'  =  y  —  y2  +  ^<$y3  —  z-  Using  the  second  equation,  the 
other  two  reduce  to 


x'  =  a  —  y  fix' =  y  -  y2  +  ^Sy3  -  x. 
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Equating  the  two  expressions  for  x!  yields  y  (a  —  y)  =  y  —  y2  +  ^8y3  —  x .  Finally, 
implicitly  differentiating  and  using  the  equation  for  x'  yields 

/  =  -y 

1  +  y  —  2y  +  8y 2 

A  mismatch  will  occur  unless  the  initial  conditions  satisfy  co  =  /x. 

4.7  The  final  nondimensionalized  system  is 

1 

hf  +  hux  +  uhx  —  0  Uf  T  uux  4-  — yhx  —  0 

Frz 


where  the  Froude  number  is  defined  as  Fr  =  U/y/gH. 

4.9  (a)  Fet  5  =  W/L  then  nx  =  4(1  +  <$)(  1  +  1/8). 

(b)  77i  =  7t((3(1  +  5)  -  V(3  +  S)(1  +  3S))(3(1  +  \/8)  -  V(3/S  +  1)(1/S  +  3). 
77p ellipse  >  F7prect  if  the  aspect  ratio  is  sufficiently  large  or  small. 

(c)  Fet  x  =  772,  y  =  77 3  and  z  =  1/77^  for  a  more  convenient  calculations:  z  = 

g(x,  y)  =  —  x)(^  —  y)(x  +  y  —  \).  Using  multivariable  calculus,  determine  the 

critical  points  of  g:  gx(x,y)  =  gy(x,y)  =  0  yielding  (x,y)  =  (0,  0) 

or  (^,  ^).  Of  these  possibilities,  (^,  ^)  maximises  g  =  1/432  and  hence  minimises 

77i  tri  >  12^3.  Note  that  (12v/3  ^  20.78)  >  (12.56  ~  47r)  as  is  expected  from  the 
result  that  the  circle  minimises  the  perimeter- to-area  ratio,  77i  circ  =  47r. 

4.11  77i  =  AT/B,  772  =  AC/B2,  773  =  A2D/B3,  T  =  (B/A)/(AC/B2,  A2D/B3) 

4.12  (a)  The  equations  for  the  dimensional  exponents  for  cancelling  out  units  in  the 
77’s  are: 

A  —  3B  —  C  +  E  +  F  =  0  [m] 

B  +  C  +  D  =  0  [kg] 

-2A  -  C  -  2D  -  F  =  0  [s] 

These  three  equations  are  linearly  independent  (as  can  be  seen  by  reducing  them  to 
echelon  form),  so  r  =  3.  There  are  6  given  quantities  and  3  base  units  (r  =  3),  so 
we  get  n  —  r  =  6  —  3  =  3  free  parameters. 

Chapters 

5.1  Using  the  rescaled  solution  (5.1),  we  can  rewrite  Burgers’  equation  as 

/UT\  fT\ 

ut  +  I  -j—  I  UUX  =  I  -  I  KUXX. 

To  make  this  scale  invariant,  the  coefficient  factors  need  to  be  set  to  one.  The  nor¬ 
malising  the  first  yields  U  =  L/T.  Normalising  the  second  yields  L  =  T1/2  and 


280 


Solutions  to  Selected  Problems 


subsequently  U  =  T“1//2.  A  scale-invariant  similarity  variable  can  be  obtained  from 
TJ\  =  LTC  =  T1/2T1/2  =  T°,  hence  c  =  1/2  and  hence  determines  rj  =  xt~ 1//2. 
Likewise  a  scale-invariant  similarity  function  is  determined  by  TI2  =  UT^  = 
y-i/2-pi  _  yO^  jience  J  —  1/2  and  f(jj)  =  tl/2u  yielding  the  similarity  solution 
form  u(x ,  t)  =  Substituting  this  into  Burgers’  equation  yields 


1 

2 


=  K 


2  ' 


5.2  Applying  (5. 1)— (5.3)  and  the  integral  yields 


Making  the  PDE  scale  invariant  selects  the  scaling  relation  U  =  L/T.  Then  making 
the  integral  scale-invariant  determines  L  =  T2//3  and  consequently  U  =  T_1//3. 
Further,  this  determines  the  similarity  variable  r\  =  x/t 2//3  and  the  similarity  solution 
as  u(x,  t)  =  Substituting  this  into  the  inviscid  Burgers  equation  yields  the 

ODE  for/(?7), 


1 

3 


dr]  =  1. 


The  similarity  solution  likewise  reduces  the  generalised  Burgers  equation,  u(ut  + 
uux)  =  kuxx,  to  the  ODE 


5.3  (a,  b)  Applying  (5.1)  to  the  PDE  yields 


du 


cr+1u  Vk4. 


To  make  this  equation  scale  invariant,  we  need  U  =  L/T  (from  the  first  coefficient) 
andTcr+1U3  =  1  in  the  second  one.  The  boundary  condition  must  be  scale  invariant 
as  well, 


u( 0,  t)  = 


*  U  =  T3, 


consequently  L  =  T4  (from  L  =  UT).  Satisfying  the  second  condition  for  the  scale 
invariance  of  the  PDE,  Ta+1U3  =  Ta+1T9  =  T°  determines  a  =  —10. 
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(c)  The  scale-invariant  similarity  variable  is  r\  —  x/tA  and  the  form  of  the  similarity 
solution  is  u(x,  t)  =  t3f(r]).  Substituting  this  form  into  the  PDE  reduces  it  to  the 


ODE, 


3/ -4/7 


d[_ 

dr) 


5.4  Satisfying  scale-invariance  in  the  heat  equation  determines  L  =  T1/2  and  the 
similarity  variable  r]  =  x/t1^2. 

(a)  The  boundary  condition  will  be  scale  invariant  for  U  =  T2.  This  yields  the  form 
of  the  similarity  solution  as  u(x,  t)  =  t2f(r]).  Substituting  in  the  PDE  yields  the  ODE 
problem 

2 f  ~  \r)f  =f",  f(0)  =  h  f(ri  -*  oo)  0. 


(b)  Here  the  boundary  condition  will  be  scale-invariant  if  U  =  L  =  T1/2  yielding 
the  similarity  solution  u(x,  t)  =  tl^2f(r]),  satisfying  the  ODE  problem, 

W  -  rtf')  =/",  /'( 0)  =  2,  — >  oo)  — >  0. 


(c)  Here  the  boundary  condition  will  be  scale-invariant  ifU  =  l/L  =  T  l!2  yielding 
the  similarity  solution  u(x ,  t)  =  t~l^2f(r]),  satisfying  the  ODE  problem, 

-\(f  +  nf)  =/",  /'( 0)  =/(  0)2,  fir, i  ->  00)  ->  0. 

(d)  Here  the  integral  condition  will  be  scale-invariant  if  L3U  =  1,  or  U  =  T-3/2 
yielding  the  similarity  solution  u(x,  t)  =  t~3^2f{rj),  satisfying  the  ODE  problem, 


r  oo 

4(3 f  +  r1f')=f",  /  r12fdr1=l,  f(r,^  oo)  ^  0. 

Jo 

(e)  Here  L  =  T  =  1  but  U  is  a  free  parameter.  The  solution  can  be  written  as 
u(x ,  t)  =  Cete~x ,  which  is  actually  a  travelling  wave,  u  =  Ce~ix^r\ 

5.5  Applying  (5.1)  to  the  PDE  yields 


du 

Yt 


The  two  coefficients  must  be  normalised  in  order  to  make  the  PDE  scale-invariant. 
The  first  coefficient  determines  L  =  T1/2  and  the  second  sets  U  =  T-1/3.  The  scale- 
invariant  similarity  variable  is  rj  =  x/t 1//2  and  the  form  of  the  similarity  solution  is 
u(x,  t)  =  satisfying  the  ODE 

-\f  -W  =f"  +/4- 
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5.6  (a)  Applying  (5.1)  to  the  PDE  yields 

du  _  /U3T\  3  /  3 du\  /UT\  3  /  d3u\ 

3 1  \  L2  /  dx  \  dx)  \  L4  /  dx  \  dx 3  / 

The  two  coefficients  must  be  normalised  in  order  to  make  the  PDE  scale-invariant. 
The  first  coefficient  determines  L2  =  U3T;  substituting  this  into  L4  =  UT  determines 
U  =  T-1/5  and  subsequently  L  =  T1/5 .  The  scale-invariant  similarity  variable  is  then 
rj  =  x/t1/ 5  and  the  form  of  the  similarity  solution  is  u(x,  t)  =  satisfying 

the  ODE 

+  =  -  iff'")'- 

Note  that  the  form  and  scaling  of  the  similarity  solution  has  already  been  determined, 
the  additional  integral  condition  happens  to  be  consistent,  J  f  dr]  =  1. 

(b)  Applying 

u(x,  t )  =  U u(x,  t),  x  =  Lx,  t  =  tc  +  T? 

to  the  PDE  yields  the  same  scalings,  U  =  T-1/5  and  L  =  T1/5 .  However  the  similarity 
variable  and  solution  now  take  a  modified  form, 

r]  =  —[  ,5  u(x,t)  =  (tc-t)~l/5f(r]), 

and  the  similarity  function  satisfies  the  modified  ODE 

\(f  +  nf)  =  -( M  -  ($"')'■ 

In  part  (a),  the  similarity  solution  evolves  to  a  limiting  behaviour  as  T  — >  oo(f— >  oo), 
while  for  this  finite-time  blow-up  case,  divergence  occurs  as  T  — >  0  (t  — >  tc). 

5.8  The  similarity  solutions  are 


h(x,t)  =  t  l/2f(xt  3/8),  u(x,t)  =  t  5^g(xt  3/8). 


Chapter  6 

6.2  At  leading  order  it  is  straightforward  to  identify  <$o  =  1  and  xq  =  3  as  a  triple  root 
of  (vo  —  3) 3  =  0.  The  next  iteration,  v  ~  3 +8\x\  yields  53v3  =  216s  hence  8\  = 
and  x\  is  given  by  the  one  of  the  three  cube  roots  of  216,  namely  x\  =  ()el2nk^  for 
k  =  0,  1,  2.  Going  on,  we  will  get  82  =  s2^3  and 

*  -  3  +  6£1/3  +  8£2/3,  jc  ~  3  —  (3  ±  /3V3)£1/3  -  (4  =f  /4V3)£2/3 

6.3  Observe  that  setting  s  =  0  in  the  equation  yields  a  contradiction  (‘60  =  O’), 
hence  the  solutions  must  be  singular  in  order  to  yields  valid  balances.  Let  v  =  8X  to 
yield 
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e6(53X3  —  5ej5^X^  —  20e5X  +  =  0. 

(i)  (2)  (3)  (4) 

Consider  the  different  possibilities  for  dominant  balances  to  determine  the  distin¬ 
guished  limits  yielding  solutions.  This  is  a  third  order  polynomial,  so  there  must  be 
exactly  three  roots  that  must  represented  within  the  set  of  distinguished  limits. 
These  distinguished  limits  are  given  by 

•  Balancing  terms  (3,  4):  8  =  £_1,  v  ~  3/s 

•  Balancing  terms  (2,  3):  8  =  s~2,  x  ~  —4 /s1 

•  Balancing  terms  (1,  2):  8  =  £-3,  v  ~  5/e3 

6.5  (a)  The  solution  is  given  by 

v(0  ~  — 2^  —  20 £t  ~  J60£  t  ' 

(b)  Note  that  after  factoring  out  t 2,  the  magnitude  of  the  terms  follows 

0(1)  »  0(st3)  »  0(s2t6) 

with  the  ordering  being  preserved  if  st3  1,  namely  0  <  t  0(£-1/3). 

6.6  Observe  that  setting  s  =  0  in  the  equations  yields  —y=  l  and  y  =  4,  hence 
we  get  a  contradiction  (y  is  overdetermined).  Rescale  each  variable  independently, 
x  =  <$(e)X  and  y  =  a(s)Y  and  carry  out  dominant  balance  of  the  system  to  obtain 
8  =  s~ 1  (from  a  dominant  balance  in  the  first  equation)  and  a  =  £°  (from  a  dominant 
balance  in  the  second  equation).  Subsequently, 

1 

v  ~  -(5  —  5s)  y  ~  4  —  5s. 

£ 

6.7  Taking  the  logarithm  of  the  equation,  we  get 

ln(2)  +  2  ln(x)  -  5x  =  3  ln(2)  +  ln(e) 

It  is  easy  to  identify  ln(e)  — >►  —  oo  as  the  dominant  term  on  the  righthand  side.  Now, 
substitute  x  =  <$o*o  with  the  assumption  that  <$o  — >  oo, 

ln(2)  +  21n(5o)  +  21n(vo)  —  55o*o  =  3  ln(2)  +  ln(£) 

Since  vo  =  0(1),  so  is  its  log.  Since  z  ln(z)  as  z  — >  oo  the  5<$o*o  term  the 
largest  term  on  the  left-hand  side  of  the  equation.  Hence  we  get  that  vo  =  1/5 
and  5o  =  —  ln(e).  Note  that  we  have  put  the  negative  sign  in  <$o  rather  than  in  the 
coefficient  so  that  the  gauge  function  is  positive  (ln(z)  — >  —  oo  for  z  — >  0).  In  general 
for  dealing  with  logarithms,  it  may  be  better  to  write  them  as  <$o  =  ln(^).  The  next 
iteration,  x  ~  ^o-^o  +  <$i*i  (with  8\)  yields  a  dominant  balance  between  terms 
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coming  from  the  original  2ln(x)  and  5x  terms  to  determine  8\  =  ln(ln(^))  and  the 

c 

solution  as 

sln<f)  +  §ln(ln<4))- 

6.10  (a)  Substituting  h  =  1  +  srj  and  u  =  1  +  sv  into  the  PDEs  and  then  collecting 
0(s )  terms  yields  the  linearised  system 


Vt  +  vx  +  Vx  —  0  vt  +  vx  +  Fr  2rjx  —  0 


(b)  We  can  then  write  the  system  in  the  form 


d 

dt 


(y)  +  (Fr-2l)^(v)-(o) 


M 


Consequently  the  determinant  condition  for  the  wavespeeds  is 


|Mr  —  XI\  = 


1  -  X  Fr~2 
1  1  —  X 


=  0 


4>  (1  -  X)2  =  Fr-2 


and  hence  the  waves  generated  will  have  speeds 


A  =  1  ±  Fr 


and  if  Fr  >  1  then  X±  >  0  and  all  waves  will  move  to  the  right  (downstream, 
subcritical),  while  if  Fr  <  1  then  X±  ^  0  and  the  two  classes  of  waves  move  in 
opposite  directions. 

Chapter  7 

7.3  Noting  that  the  inhomogeneous  boundary  conditions  are  0(1),  the  dominant  is 
finite  and  the  coefficient  3  <  (4  —  x2)  <  4  is  bounded  on  the  domain,  we  expect  the 
solution  to  be  finite  and  bounded,  hence  we  will  assume  /3  =  0  and  seek  solutions 
of  the  form  y  =  Y(X).  Fetting  X  =  (x  —  x*)/sa  for  a  >  0,  equivalently  we  have 
x  =  x*  +  saX.  The  inhomogeneous  term  then  becomes 

COS(yV)  =  COS(^V*  +  £a?~X) 

There  are  two  cases  that  need  to  be  considered  for  the  scaling  of  this  term  with  respect 
to  the  position  of  the  boundary  layer: 

> 


x*  =  0 
v*  =  1 


cos(^v)  =  COS(£ajX)  ~  1 
cos(^v)  =  —  sin(£ajX)  ~  —£a^X 
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To  determine  the  dominant  balances,  examine 


g,  1  —  2(1  y7/ 


(1) 


+  e°(4 


-xjyy  =  cos(|x), 

(2)  (3) 


where  term  (3)  is  either  (9(£°)  or  0(sa)  as  described  above. 

For  a  =  0,  we  recover  the  outer  distinguished  limit  which  yields  the  leading  order 
outer  solution 


yoO)  = 


COS(yV) 

4  —  X2 


Note  that  this  solution  does  not  satisfy  either  boundary  condition  (with  yo  (0)  =  —  1  / 4 
and  yo(l)  =  0,  so  we  will  need  boundary  layers  at  both  v*  =  0  and  v*  =  1.  First 
consider  the  case  that  the  boundary  layer  is  on  the  left  edge  of  the  domain,  v*  =  0, 
making  term  (3)  be  0(1).  Then  the  dominant  balance  for  the  singular  distinguished 
limit  is  between  terms  (1,  2)  yielding  a  =  Term  (3)  is  of  the  same  order,  0(1), 
and  hence  also  takes  part  in  the  dominant  balance.  The  resulting  leading  order  inner 
problem  fory  =  Y(X)  withX  =  x/s1^2  is 


Yq  —  4Tq  =  1,  *b(0)  =  -l 


3 

4 


e 


-2X 


1 

4’ 


where  we  note  that  we  have  eliminated  an  un-matchable  exponentially  growing  term 
for  X  — >►  oo.  Applying  the  matching  condition  between  this  inner  solution  and  the 
outer  solution, 


lim  Yr\  ~  lim  yn 

X^oo  x^O 


1 


which  fortunately  yields  consistency  and  gives  the  overlap  as  —1/4. 

For  the  boundary  layer  at  v*  =  1,  term  (3)  is  0(sa).  The  singular  distinguished 
limit  is  still  a  =  \  but  now  only  terms  (1,2)  are  involved  in  the  dominant  balance, 
yielding  the  leading  order  inner  problem  for  y  =  Y (X)  with  X  =  (x  —  1)/£1//2, 

Yq  —  3F0  =  0,  *o(0)  =  2  =►  Y§  =  2e^x, 


and  again,  an  unmatchable  exponentially  growing  term  (for  X  — >►  —  oo)  has  be 
excluded.  Here,  applying  the  matching  condition  between  this  inner  solution  and  the 
outer  solution, 

D 

lim  T0  =  lim  yo  =  0, 

X — y — oo  x—>  1 


again  yielding  consistency  of  the  construction  process,  here  with  zero  overlap. 

Forming  the  left  and  right  boundary  layer  corrections  by  subtracting  the  respective 
matching  overlaps  from  their  inner  solutions,  we  can  write  the  composite  solution 
(7.22)  as 
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^  cos (2*)  _  3^_2 x/jz  2ey3M*-i) 

4  —  x2  4 

7.4  (a)  Given  that  the  boundary  occurs  at  v*  =  0,  the  singular  solution  will  have  the 
form  y  =  s^Y(X)  with  X  =  x/sa  and  a  >  0.  The  homogeneous  initial  condition 
provides  no  information.  The  second  initial  condition  gives 


/(0)  = 


*  ep~aY'(0)  =  4s~2, 


hence  7'(0)  =  0  and  /3  —  a  =  —2.  Using  this  in  the  ODE,  we  get 


e - 1  -a y//  _l_  2e~2Y^  —  6e ~2+a Y  =  5e“X 

(i)  (2)  (3)  (4) 


It  can  be  shown  that  the  only  consistent  singular  dominant  balance  is  between  terms 
(1,2),  yielding  a  =  1,  hence  ft  =  —  1.  Consequently,  the  inner  problem  is 


Y "  +  27'  -  6sY  =  5s^X,  7(0)  =  0,  7'(0)  =  4, 


which  reduces  to  the  leading  order  problem 


7q  +  27'  =  0 


0 


Y0(X)  =  2(1  -  e~2x), 


satisfying  both  initial  conditions. 

(b)  Applying  the  matching  condition, 

lim  s^Yilmer(X) 

X^oo 


lim  s^youtel(x), 

x— >0 


we  get  from  the  inner  solution  that  the  limit  is  2/s  hence  the  outer  solution  must  be 
scaled  by  pouter  =  —  1  and  satisfy  initial  condition  yOuter(0)  =  2. 

(c)  Using  =  —  1 ,  a  =  0  for  the  scaling  of  the  outer  solution  yields  the  equation  for 

y  —  Jouter 


s°y"  +  2 e  Y  -6s  ly  =  5v,  y(0)  =  2 


-1  / 


-1 


which  at  leading  order  reduces  to 


2jo  -6y0  =  0  y0(x )  =  2e 


3x 


Consequently,  we  can  construct  the  composite  solution  from  the  outer  solution  plus 
the  inner  solution  minus  the  overlap  as 


2  3X  2  / 
y  ~  -e5  H —  ( 1 
s  s  V 


p—^x/£\  _  ?  =  ?  p-2 x/e 


/  =  -  le 

/  s  s 


) 
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7.5  (a)  The  leading  order  outer  solution  is  given  by 

2/  =  eyn  =  0  ==>■  y0(x)  =  -  In  (\(x  -  c))  , 

where  the  choice  of  the  constant  of  integration  depends  on  which  boundary  condition 
applies  to  the  outer  solution. 

(b)  Assuming  the  solution  to  be  finite,  y  =  0(1),  hence  /3  =  0,  dominant  balances 
are  determined  by 

gl— 2ttr"  :  +  2s~aY' ,-g°eF  =  0. 

(1)  (2)  (3) 

The  outer  distinguished  limit  is  obtained  by  balancing  terms  (2,3)  at  a 
singular  distinguished  limited  follows  from  balancing  terms  (1,2)  at  a 
leading  order  inner  problem  and  its  general  solution  is 

7o  +  2Fo  =  0  =►  Yo(X)  =  Cl  +  c2e~2x. 

As  yet  we  have  not  determined  the  position  of  the  boundary  layer  (x*),  but  from  the 
form  of  To  (2Q  we  can  see  that  only  the  choice  v*  =  0  can  yield  a  nontrivial  inner 
solution.  Consequently,  applying  the  boundary  condition 

F0(0)  =0  =►  Y0(X)  =  Ci(l  -  e~2x). 

(c)  Meanwhile  we  know  that  the  other  boundary  condition,  y(l)  =  0  must  apply  to 
the  outer  solution  (since  there  is  no  boundary  possible  at  v*  =  1),  hence  the  final 
form  of  the  outer  solution  becomes 

yo(x)  =  -  In  (2O+  1)) . 

Asymptotic  matching  to  the  inner  solution  then  determines  that  C\  =  In  (2).  Conse¬ 
quently  the  leading  order  composite  solution  is 

j - In  (i(x  +  1))  -  ln(2)e-2jc/£. 

(d)  Expanding  the  outer  solution  as  y  ~  yo  +  eyi ,  the  exponential  term  can  be  written 
as 

^  jo+m  =  eyoesyi  „  eyo  (X  +  eyQ  +  l£2y2  +  . . .  \ 


=  0.  The 
=  1.  The 
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Collecting  O(s)  terms  in  the  ODE,  the  equation  foryi  is  then 


y'o  +2/i  +yieyo  =0 


> 


y  iW  = 


In  ( \  (x  +  1)) 
2(x  +  1) 


7.7  (a)  The  outer  solution  is  given  by 


y 


zb  a/ 4  —  x2  + 


£ 

2(4  -  x2) ' 


(b)  At  v*  =  2,  the  limiting  behaviour  of  the  outer  solution  for  v  — >  2  determines 
that  p  =  and  the  dominant  balance  of  terms  in  the  ODE  determines  the  singular 

9  1 

distinguished  limit  as  a  =  | ^  with  the  inner  problem  being 


4X  +  s2/3X2  +  Y2  =  -Y' 


X  = 


x  —  2 
727T- 


At  v*  =  0,  there  is  one  inner  solution  that  must  be  matchable  to  the  outer  solution, 
y(x  — >  0)  ~  ±2  =  D(l),  hence  it  must  have  /3  =  0.  A  dominant  balance  in  the  ODE 
yields  a  =  1  with  the  inner  problem 


e2X2  +  Y2  =  4  —  Y',  X  = 

£ 


At  x*  =0  there  is  a  different  inner  solution  that  satisfies  the  initial  condition  y  (0)  = 
3/s,  hence  having  /3  =  —  E  Determination  of  the  dominant  balance  with  this  value 
of  p  yields  a  =  2  and  the  ODE 

s6X2  +  Y2  =  4s2  -  Y'  X  = 

since  this  solution  exists  on  a  narrower  domain  (e2  <<C 
an  inner-inner  solution. 

Chapter  8 

8.1  (a)  Using  the  method  of  separation  of  variables,  we  get  (8.8), 


v 

£z 

£),  this  is  sometimes  called 


oo 

U(X,  Y)  =  ^  C„  sinh(^Y)  sin(^X) 
//— 1 
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Evaluating  the  boundary  condition  along  Y  =  H  yields 


91) 

9Y 


oo 


V  (^L  cosh(^H))  sin(^fX)  =  e"3X/L 

Y=H  B=i V  L  ’ 


and  the  Fourier  sine  series  determines  the  final  form  of  the  solution  as 


U(X,  Y) 


sinh(^fY)  sin(^fX) 


(b)  The  leading  order  outer  problem,  scaled  with  U  =  1  m  is 

r\ 

U0yy  =  0,  uo(x,  0)  =  0,  U0y  =  Y\e~ 
yielding  uo(x,y )  =  H e~3xy. 

(c)  Substituting  X  =  Lx,  Y  =  Hy  and  taking  the  s  — >►  0  limit  of  the  separation  of 
variables  solution  yields  U  =  (Fourier  sine  series  of  e  ^x)Y\ y. 

8.2  (a)  The  leading  order  outer  problem  is 


UQyy  =  0,  uo(x,  0)  =  sin(57rv),  uo(x ,  1)  =  cos(3ttx) 
yielding  uq(x,  y)  =  (cos(3ttx)  —  sin(57rv))y  +  sin(57rv). 

(b)  Since  limx^o  uo(x,  y)  =  y,  the  boundary  layer  correction  at  x*  =0  can  be 
expressed  as  VL(X,  y)  =  U(x,  y)—  y  withFL(0,  y)  =  sin(^7ry)—y  and  homogeneous 
Dirichlet  boundary  conditions  on  rest  of  the  boundary  of  the  semi-infinite  strip, 
0  <  y  <  1,  x  >  0.  Then 


VL(XL,  y)  =  -  2  V  (  1)n  e~,mxL  sin(«7ry)  XL  =  ->0. 

(An1  —  l)nn  s 

n=  1 

Similarly,  the  right  boundary  layer  correction  can  be  determined  to  be 

O  _  1 

VR(XR ,  y)  =  Yj  ——3  -e(lk+l)nxR  sin((2A:  +  1  )ny)  XR  = -  <  0. 

y£K  “I-  1 JT  G 


(d)  The  nondimensional  form  of  the  flux  is 


R  1  Pl 

JR  =  - 


oo 


ux(l,y)dy  ~  i  [  5n(y  -  1)  +  -  , 

L  Jo  L  J0  ° 


1  8  sin((2k  +  l)7ry) 

>  - dy. 


s  (2k  +  1)27T2 

k=o  v  ' 
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The  value  of  the  flux  is  dominated  by  the  0(\/s)  contribution  from  the  boundary 
layer  correction 


JR  - 


1 

sL 


oo 


z 

k=0 


16 


16 


(2k  +  1)37T3  H7T3 


(1.0518) 


0.543 

H 


8.3  Let  f(x)  =  F(X).  (a)  The  scaled  form  of  the  no-flux  condition  on  the  top 
boundary  is 


uy  +  £“157r  sin(3Ttx)ux  =  0  aty  =f(x )  =  15  +  5cos(37rv). 
(b)  The  0(1)  outer  problem  is 


U0 yy  =  0  U0y(X ,  0)  =  0  U0y(X,f(x))  =  0 

The  O(s)  outer  problem  is 

UOxx  +  U\yy  =  0  U\y(x,  0)  =  0  U\y(x,f)  +  1571  sin (3n  x) U0x (x , f)  =  0 
yielding  the  solutions 

u0(x,y)  =  C2(x),  u i (x,  y)  =  - \ C'J ( x)y 2  +  C4 (x) 
where  the  top  boundary  condition  gives  the  compatibility  condition 


d  (  dC2  . 

—  ( (15  +  5  cos(3ttjc))  —1=0 
dx  \  dx 


) 


(d)  Applying  matching  of  the  outer  solution  to  the  boundary  layers  implies  zeroing  the 
n  =  0  coefficients  in  the  cosine  series  expansions  of  the  boundary  layer  corrections 
yields  the  boundary  conditions  on  C2(x), 


CO  = 


do  = 


i  r20  r  y3 


20  jo 
1  f10- 

10  Jo 


100 


-  c2(0) 


dy  =  0 


3y2  -  C2(l) 


dy  =  0 


* 


C2(0)  =  -20, 


*  C2(l)  =  100. 


8.5  (a)  The  0(1)  problem  is 


00 yy  —  0,  (f)0y(x ,  0)  —  0 


r 


>  0o  (x,y)  =  Co(x,t) 


The  0(s2)  problem  is 


01  yy  H"  *p0xx  —  01y(v,  0)  —  0 


>  01  (x,y)  =  ~jC0xx(x,  t)  +  C\(x,  t). 
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The  0(s4)  problem  is 

4>2yy  "b  01  xx  =  +  4>2y(Xi  0)  —  0  —  ^  01  (%•>  j)  —  24  ^Oxxxx U 5  0  Qz (+  0 . 


(b,  c)  See  Exercise  9. 12. 

Chapter  9 

9.1  Using  the  initial  conditions,  vo(?)  =  cos(?)  +  2  sin(?).  Using  the  O(s)  equation 
with  xo(t)  specified  and  the  initial  conditions  ;q(0)  =  —  1,^(0)  =  1,  x\(t)  = 
[115  sin(?)  —  12  cos(?)]  +  [60?  sin(?)  —  120?  cos(?)]  +  [11  cos(3?)  +  2  sin(3?)]  with 
the  homogeneous,  secular  and  non-resonant  respond  terms  grouped  respectively. 

9.2  (a)  vo  =  acosO  with  6  ~  (2  +  )t  works  for  any  a  >  0.  (b)  vo  =  acosO 
with  coo  =  3,  co i  =  0  satisfies  the  problem  only  for  a  =  2. 

9.3  (a)  The  amplitude  equations  are 


dA 

dr 


\A-^{A2+B2), 
z  8 


dB  i  B  ^  o 

—  =  \B - (£2+A2) 

dr  2  8 


(b)  Note  that  =  2A^  +  2B^  =2R^  .  Expanding  out  2 yields 


dR 

dr 


5^(1  -  \R 2) 


which  has  R*  =  2  as  an  equilibrium.  Note  that  this  matches  the  periodic  solution 
determined  in  Exercise 9.2(b),  but  now,  using  MMTS,  we  have  a  prediction  for  rate 
of  convergence  to  the  limit  cycle. 

9.5  (b)  Suppressing  the  e±lt  resonant  forcing  terms  in  the  O(s)  equation  for  X\  yields 

—  =  -\pc  +  -ia|C|2C  +  \eiY\ 
dr  2  2  2 

with  the  second  solvability  condition  being  exactly  the  complex  conjugate  of  the 
above  equation  (and  hence  yielding  no  independent  information). 

(c)  Substituting  into  the  amplitude  ODE  for  C  reduces  to  the  algebraic  equation 


i  3  9  1 

y  =  li/3  H — aM2 - e 

y  2  H  2  4  M 


■id 
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Separating  real  and  imaginary  parts  of  this  equation  yields 


cos  6 
AM  ’ 


sin# 

4M~' 


For  fixed  a,  /,  expressing  sin  6  =  ±V  1  —  cos2  0,  we  can  solve  the  second  equation 
to  get  cos  0  =  ±a/  1  —  A/32M2.  Substituting  this  into  the  first  equation  gives  the  final 
(real- valued)  form  of  the  detuning  relation: 


y(M)  =  \aM 2  ± 


sfx  -4y62M2 
AM 


Entrained  solutions  exist  up  to  a  critical  amplitude  set  by  the  damping,  M  <  Mc  = 
1  / (2)0).  Noting  that  the  terms/  =  xA-sax3  can  be  interpreted  as  a  restoring  force  due 
to  a  spring,  the  case  of  a  >  0  is  called  a  hardening  spring  since  the  restoring  force 
is  stronger  than  for  the  corresponding  linear  Hooke’s  law  spring  [54].  Conversely, 
the  case  a  <  0  is  called  a  softening  spring.  In  contrast  to  the  case  of  a  linear  oscil¬ 
lator  near  resonance  (see  Fig.  4.1),  for  a  7^  0,  the  amplitude  of  the  forced  solution 
is  shifted  away  from  the  oscillator’s  natural  frequency  by  the  amplitude-dependent 
detuning,  y(M).  For  hardening  (softening)  springs,  the  resonant  peak  gets  shifted 
above  (below)  the  (zero-amplitude)  linear  natural  frequency.  See  below  for  a  plot 
of  the  detuning  relation  plotted  for  a  hardening  spring  case,  a  >  0  along  with  the 
(undamped)  resonant  “backbone”  curve  y  =  | ocM 2 . 


7 

9.6  The  amplitude  equations  are  ^  =  0  and  ^  =  —  yielding  the  solution 


9.7  The  amplitude  equations  are^  +  ^A  =  0,  ^  —  -jrZ?  =  0. 
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9.8  (c,  d)  The  amplitude  equation  is  determined  by  forcing  the  coefficient  of  the 
resonant  forcing  term  ( e~l4t )  in  the  equation  for  X\  to  vanish,  ^  ^ A 2  =  0, 

yielding  the  solution  v  ~  g~/4r. 

9.10  Substituting  the  expansion  into  the  ODE,  at  leading  order  we  get 

X()tt  +  Xq  =  s^  Pxl  +  cos  t. 


The  inhomogeneous  forcing  terms  will  be  higher  order  if  —1  <  <  1  to  yield 

Xo(t,  r)  =  A( r)elt  +  B(r)e~lt.  In  the  equation  for  X\,  deferring  the  resonant  terms 
to  higher  order  requires  a  —  y  >  0  and  1  +  —  y  >0  with  1  —  /3  —  y  =0  needed  to 

make  the  remaining  forcing  term  be  0(1).  Finally,  in  the  equation  for  X 2  balancing 
resonant  terms  leads  to  a  =  4/3,  /3  =  1/3,  y  =  2/3  with  amplitude  equations 


dA  1  10  9  dB  1  10  9 

-2  i—  +  -  +  — A2B  =  0  2  i—  +  -  +  —  AB2  =  0, 

dr  2  3  dr  2  3 


yielding  a  stable  limit  cycle  with  Xo(t)  =  —  (6/5) 1//3  cos  t. 


9.11  (a)  The  trivial  solution  v  =  0  is  an  equilibrium  state.  Linear  stability  analysis 
yields  X2  +  1  +  se~2X  =  0  with  0(1)  solutions  X  ~  zb/  zb  £  \ieTl\  both  of  these  have 
Re  (A)  >  0  so  the  trivial  solution  is  unstable,  (b)  The  amplitude  equations  are 


dA  1  dB  1 

- A  (A  sin  2  +  B  cos  2)  =  0, - f  A  (A  cos  2  —  B  sin  2)  =  0. 

dr  2  dr  2 

9.12  (a)  Taking  dt  of  the  first  equation  in  (9.36)  yields  Fot  =  —Cott  and  hence  Cott  = 
Coxx  from  the  second  equation.  Taking  dxx  of  the  first  equation  yields  Fqxx  =  —Cotxx 
then  using  this  in  dt  of  the  second  equation  yields  Fqu  =  Fqxx- 

(b)  The  0(1)  equations  determine  that/o  =  c$z  at  leading  order.  There  is  one  s2dT 
term  from  the  0(1)  equations  that  gets  added  to  the  0(s2)  (9.37)  equations.  The  sum 
of  dz  of  the  first  equation  plus  the  second  cancels  the  c\,f\  terms  to  yield  the  KdV 
equation  (9.38)  depending  only  on/o(z,  r). 

9.13  X(1)  ~  2  -  ±e,  A(2)  ~  -4  +  | s 

Chapter  10 

10.1  (a)  f{x)  =  —x3  +  9x  +  3x2, 

(b)  (v*,  y^)  =  (0,  0)  is  the  only  equilibrium  point;  the  eigenvalues  from  its  linear 
stability  are  X  =  (9  ±  ^/W\~^T6s) / (2s)  >  0,  so  it  is  an  unstable  node. 

(c)  The  slow  manifold  is  yo(*o)  =  (*q  —  ~  9xo)/4. 

(e)  The  slow  manifold  has  local  extrema  at  y(—  1)  =  5/4  and  y(3)  =  —27/4.  In 
the  fast  evolution  stages,  y  is  constant  and  v  evolves  from  the  extrema  value  to  the 
other  value  on  the  slow  manifold,  satisfying  y  =  S(x).  Solving  the  cubic  S(x)  =  5/4 
yields  vmax  =  5  and  solving  S(x)  =  —27/4  yields  xm[n  =  —  3. 
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(f)  Implicitly  differentiating  the  equation  for  the  slow  manifold,  j-f(y  =  S(x )),  and 
applying  ^  =  —x  yields  the  equation  for  evolution  on  the  slow  manifold, 


dx 

dt 


4x 


3x2  —  6x  —  9 


dx 

g(x) 


3x2  —  6x  —  9 


4x 


dx 


The  limit  cycle  is  composed  of  two  fast  and  two  slow  stages,  P  =  7fast,  l  +  Tsiow,2  + 
^fast,3  T  Ts1ow,4  ^  ^siow,2  T  ^siow,4-  Stage  2  starts  at  x  =  5  and  ends  at  x  =  3, 

1  f 3  9  9 

^slow,2  =  7  /  -~h6-3xdx  =  3-h-  ln(3/5). 

4  J5  x  4 

Similarly,  stage  4  starts  at  v  =  —3  and  ends  at  v  =  —  1, 


1  f-1  9  9 

^slow,4  =  7  /  -+6-3xdx  =  6~-  ln(3) 

4  J-3  x  4 


Therefore  the  period  is  P  ~  9  —  (9/4)  In (5)  ~  5.38. 

10.2  (a)  Setting  s  =  0  in  the  z!  equation  yields  the  slow  manifold  as  z  =  S(y)  = 
y  —  y2  +  ^ y 3  and  then  the  leading  order  slow  system  is 

dx  o  dyo  2  13 

—  =  2-V„  —  5,0 

The  only  equilibrium  point  of  the  slow  system  is  (x*,y*)  =  (|,  2),  which  is  a  stable 
spiral  point  with  eigenvalues  A  =  ^(—  1  ±  3/).  Consequently  the  t  — >  oo  solution 
will  be  (x,y,z)  ~  (§,  2,  |  =  5(2)). 

Note  that  the  initial  condition  is  not  on  the  slow  manifold  since  0^5(3),  therefore 
there  will  be  an  initial  layer  governed  by  the  fast  system  (for  the  distinguished  limit 
with  a  =  1  set  by  the  z'  equation), 


dX 

dT 


dY 

dT 


=  s(X-Z), 


dZ 

dT 


=  Y  —  Y2  +  jY3  —  Z 


At  leading  order,  the  initial  layer  will  have  Xo(T)  =  1,  To(^)  =  3,  constants  set  by  the 
initial  conditions.  Then  the  last  relation  is  Zq  =  3  —  Zo  yielding  Zo(T)  =  3(1  —  e~T ) 
which  connections  the  initial  condition  to  the  point  (x,y,  z)  =  (1,3,3)  on  the  slow 
manifold. 

(b)  Setting  s  =  0  in  the  /  equation  yields  the  relation  x  =  z.  Using  that  result  in 
the  x'  gives  a  second  expression  for  z'  that  could  be  matched  to  the  z!  equation  to 
yield  2  -y  =  y  —  y2-\-  ^ y 3  —  z.  The  expression  for  z  =  z(y)  is  monotone  increasing 
and  hence  could  be  inverted  to  yield  y  =  y(z)  to  describe  the  slow  manifold  as  a 
parametric  curve  in  3D.  Implicitly  differentiation  gives  motion  on  this  curve  as 
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dy  _  2-y 

dt  2  —  2y  +  y2  ’ 

which  has  a  stable  equilibrium  point  at  y*  =  2.  Consequently  the  solution  for  t  — >  oo 
will  approach  (x,y,z)  ~  (§  =  *(|),  2,  |  =  z(2)). 

Since  the  initial  condition  does  not  lie  on  the  slow  manifold,  there  will  be  an  initial 
layer  governed  by  the  fast  system, 


dX 

dT 


dY 

dT 


=  X  —  Z, 


dZ 

dT 


=  s( 


=  e  I  Y  -Y2  +  \y3  -Z 


) 


At  leading  order,  the  initial  layer  will  have  Xo(T)  =  0,  Zq(T)  =  1,  constants  set  by 
the  initial  conditions.  Then  the  last  relation  is  Fq  =  —1,  describing  linear  decrease 
of  To  from  the  initial  value  Fo(0)  =  3  until  it  reaches  the  slow  manifold,  at  z(y)  =  1 
(aty  «  2.1517). 

10.3  (a)  The  dimensional  rate  equations  are 


dA 

dJ 


d B  .  . 2  ■  ~  _  dC 


—  — 2k-|  A  +  2k2B  —  k^AB,  =  k-|  A  —  k2  B  —  kgAB, 


=  k3AB. 


(c)  The  scaled  problem  is 


da 

dt 

where 


9  db  o  dc 

■2  az  +  2rJ\b  —  Tl2ab,  s —  =  a1  —  fl\b  —  Jl2ab,  — 

dt  dt 


=  Tl2ab 


B 


£  = 


0 


A0’ 


m  = 


k2Bo 

Mg’ 


n2  = 


^Bq 
ki  Aq 


(d)  By  setting  e  =  0  in  the  br  equation,  we  get  the  slow  manifold,  b  =  S(a)  = 
a2 /(Tli  +  772^).  Substituting  the  slow  manifold  for  b  in  the  rate  equations  for  a' ,  c' 
yields 


da  3772a3  dc  772a3 

dt  Tli  +  772a  dt  T7\  +  772a  ’ 

which  is  consistent  with  the  overall  expectation  of  the  sum  c  +  ^a  being  conserved. 
Undoing  the  dimensional  scalings,  we  get 

dA  3k-|  k3A3 


d~T  k2  +  k3A 


G(A)  and  F(A)  =  ^G(A). 
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10.5  The  reactions  take  the  form 


A+B^2P  B^2C  C+A^D  +  P  D  +  B  P  +  C 


The  rate  equations  are 


dA 

df 

dC 

df 

dD 

df 


=  — k3AC  +  k4DP 


d  B 

df 


=  —  k-|  B  +  k2C  —  ksBD 


=  2k-|  B  —  2k2C  —  k3AC  +  k4DP  +  ksBD 


=  k3AC  -  k4DP  -  k5BD 


dP 

df 


=  k3AC  —  k4DP  +  ksBD 


Using  the  QSSA  applied  to  the  intermediates  sets  C'  =  D'  =  0  yielding  the  slow 
manifold-type  relations 


D- 


k3 A  /  k-|  B 


k4P  +  ksB V  k2 


Substituting  these  into  the  rate  equation  for  P’  yields 


dP  «ab3/2  /  k! 

a  =  2k3 


dT  B  +  fiA 


k2 


k4 

^5 


Chapter  1 1 

11.1  Integrals  of  the  Gaussian  yield 


*oo  poo 

p(x,t)dx  =  C i,  /  xp{x,  t)  dx  =  C1C3, 

—00  J— 00 


*00 

x2p(x,  t )  dx  =  Ci  (C2  +  2 (t  +  C2)) 


—00 


Matching  these  results  with  (11.3),  (11.4)  and  (11.7)  determines 


f  Mi  1  f 

C 1  =  Mo  =  /  f  dx,  C3  =  —  =  —  /  xf  dx,  C2  = 

M0  M0iJ  2M0 


1  r  9  m? 

—  /  x2/  dx - 3- . 

^0  J  2Mq 


11.2  (a)  Integrating  the  PDE  against  and  applying  the  boundary  conditions  yields 
(noting  that  p(|x|  — >  00)  — >►  0  implies  that  3^p(|x|  — >  00)  — >►  0) 


dMo  dM\  dM2 

=  4 M0,  - --2Af0  =  4Afi,  — -  -4Afi  =  6M0  +4M2. 


(it 


(it 


(it 


Solving  this  system  of  linear  ODEs  and  applying  the  initial  conditions,  Mo(O)  = 
\frc ,  Mi(0)  =  0,  M2(0)  =  ^  +J~tc  yields 
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M0(0  =  ,  M\(t)  =  l^/nte^ ,  M2(t)  =  | ' \t 2  +  6f  +  ^  £4r. 


(b)  Integrating  the  PDE  against  x°,  xl  and  applying  the  boundary  conditions  yields 


dMo  _  2  _  _3dP 


dt 


dx 


+  4  M0, 


x=0 


dM  i 


-  2M0  =  3  +  4Af  i 


11.3  (a)  Integrating  the  PDE  against  x°,xl  and  applying  the  boundary  conditions 
yields 


dMo 

dt 


dM  i 

— —  =  -Mi  -  (p(;r,  0  -  p( 0,  0), 
dt 


(b)  Substituting  the  separation  of  variables  form  into  the  PDE  yields  that  =  1  +  k2 
for  k  —  0,  1 ,  2, . . . .  Namely,  all  modes  decay  exponentially  since  >  0,  butAo  =  1 
decays  the  least  rapidly. 

Examining  the  initial  condition  at  t  =  0,  p(x,  0)  =f(x)  and  obtaining  the  Fourier 
cosine  series  coefficients  (see  Appendix  A)  yields 


If71  2  fn 

ao  =  —  /  f  dx,  cy/v  =  /  f  cos(kx)  dx. 

n  Jo  X  Jo 


Retaining  the  slowest-decaying  mode  from  the  series  for  p(x,  t)  then  yields 


e  r  fn 

p(x,  t)  ~  —  /  f  dx  as  t  —>  oo. 

n  Jo 

(b)  Integrating  the  PDE  against  x°,  xl  and  applying  the  boundary  conditions  (p  — >  0 
as  |jc|  — >  oo)  yields 


dMo 

dt 


dM2 

— -  =  2M0  -  M2. 
dt 


11.4  (a)  Integrating  the  PDE  directly  yields  M'0(t)  +  p(o o,  t)  —  p( 0,  t)  =  —2Mo(t), 
then  applying  the  boundary  conditions  yields 


dMo 

dt 


08  -  2 )M0. 


(b)  Integrating  the  PDE  directly  yields  the  same  initial  form  as  in  part  (a),  but  the 
new  birth  condition  yields  the  ODE 


dMo 


dt 


2  Mo  +  Mi . 
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Integrating  the  PDE  against  e  3a  similarly  yields 


dM\ 

dt 


—AM\. 


11.5  (a)  Mo  =  ffdx,M\  =  f  xf  dx. 

(b)  Note  that  the  porous  medium  equation  can  be  re-written  as  Pt  —  \  ( p4)xx  to  make 
integration  by  parts  more  convenient, 


dM2 

dt 


x2(p4)X]Cdx  =  I 


K 


x2{p4)x  ~  2 xpA 


+  2  /  p4  dx  )  >  0, 


) 


where  the  boundary  terms  vanish  due  to  the  boundary  conditions  and  the  integral  is 
positive-definite. 
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